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1 Introduction 

This paper is a study in the geometry and topology of anti-self-dual Yang-Mills moduli 
spaces. Although moduli spaces were studied extensively for their own sake in the 
1970s and early 1980s, in the late 1980s and early 1990s such studies were primarily 
a means to an end. Moduli spaces were studied to compute Donaldson invariants, 
and Donaldson invariants were computed for their applications to classifying smooth 
4-manifolds. Seiberg-Witten theory has, of course, made that last road obsolete. It is 
believed that the Seiberg-Witten invariants determine the Donaldson invariants, and 
the former are far easier to handle. 

However, Seiberg-Witten theory has opened up new uses for Donaldson theory. 
From Seiberg-Witten theory, we now have a much better understanding of Donald- 
son invariants. Instead of using moduli spaces as a tool for computing Donaldson 
invariants, we can now use Donaldson invariants as a tool for understanding moduli 
spaces. This paper is an exercise along those lines. 

A basic problem in four- dimensional gauge theory is to understand the "simple 
type" condition. In Donaldson theory, a manifold is said to have simple type if its 
Donaldson invariants satisfy a certain recursion relation ([KM]; see (1.2) below). In 
Seiberg-Witten theory, a manifold has simple type if it has no Seiberg-Witten classes 
of nonzero index. The two notions of simple type are believed to be equivalent, so 
that theorems proved about one form of simple type should yield information about 
the other. 

In this paper we work with the Donaldson theory sense of simple type, examining 
what simple type implies about the geometry of anti-self-dual moduli spaces. In two 
ways — with intersection theory and with de Rham theory — we measure the extent to 
which the boundary region of moduli space contributes to the simple type recursion 
relation. Our results imply that the anti-self-dual moduli spaces associated to any 
manifold of simple type have a very surprising interior geometric structure. Widely 
satisfied sufficient conditions are known for a manifold to be of simple type [KM] , and 
it is conjectured that indeed all 4-manifolds with 6_|_ > 1 are of simple type. Hence 
our results apply to a great many manifolds. 

Simple type says that the k + 1st moduli space A^fc+i, intersected with certain 
varieties, has the homology of a certain multiple of the k-th moduli space Aik- Our 
intersection theory approach is based on the construction in [SI] of a geometric rep- 
resentative of /i of a point (see below). Using this representative we show that the 
portion of (a small perturbation of) Aik+i near the boundary contributes 6/64 of the 
homology required for simple type, regardless of the topology or geometry of the un- 
derlying 4-manifold. (For a quick, heuristic derivation of this 6/64, see [S2].) Simple 
type thus reduces to a statement relating Aik to nontrivial structure in the interior 
of M.k+1 (unless our small perturbation of A4k+i is drastically unfaithful topologi- 



cally, which seems highly unhkely). This is surprising, since the only known relations 
between Ai^ and Aik+i involve Taubes patching, and relate A^^ to the boundary of 

Mk+i- 

Our second approach is to use differential form representatives of the images of the 
/i map. One then takes the wedge product of these forms and integrates over Aik+i- 
If we restrict the domain of integration to a neighborhood of the boundary of A^^+i, 
we can reinterpret the simple type condition in terms of the integral of a certain 
8-form over a submanifold that represents the space of "bubble parameters" in the 
neighborhood of a background connection in A^^. We show that, again independently 
of the topology and geometry of the base manifold, this integral has precisely 1/8 the 
value of what one would naively expect if simple type were captured purely by a 
neighborhood of the boundary. Thus again simple type becomes a statement about 
the nontrivial structure of the interior of moduli space. 

It is curious but no contradiction that the two approaches yield the different nu- 
merical answers 6/64 and 1/8. While the Donaldson polynomial is topological, hence 
independent of the choice of geometric or de Rham representatives, the contribution 
of each region of moduli space is geometric, and need not be the same for two different 
representatives. Indeed, the de Rham and geometric representative calculations not 
only disagree on the contribution of the boundary region, but also disagree on how 
close to the boundary the essential contributions are. In terms of the small parameter 
L, described below, the geometric representative picks up contributions from bubbles 
of size 0{L'^)^ while the bulk of the support of the de Rham representative is on 
bubbles of size 0{L). 

To state our results more precisely, we must review the definition of the Donaldson 
invariants, and of simple type. Let N be an oriented 4-manifold, let G = SU{2) or 
5*0(3), and let Bl be the space of irreducible connections (up to gauge equivalence) 
on Pfc, the principal G bundle of instanton number k over A^. Let Aik '^ B^ he the 
space of irreducible connections on Pk with anti-self-dual curvature, modulo gauge 
transformations. We will frequently omit the index k. 

Donaldson [Dl, D2] defined a map /i : Hi{N,Q) -^ H^~'{Bl,Q), i =0, 1, 2, 3, 
whose image freely generates the rational cohomology of Bl- Donaldson invariants 
are then defined by pairing the fundamental class of Aik with products of fj, of the 
homology classes of A^, where k is chosen so that the dimensions match. Formally, 

for elements [Si], . . . , [S„] G H^{N), we write 

z}([Si] . . . [s„]) = MPi]) - ■ ■ ■ - /i([s„])[Atfc]. (1.1) 

Now let X be the point class in Ho[N), and let u be any formal product of classes in 
H^:{N). The simple type condition is that, for all u, 

D(x^iu)=AD(iu). (1.2) 



Of course, the "fundamental class of A4k" is usually not well defined, as A4k is 
typically not compact. The usual way to make sense of (1.1) and (1.2) is with geomet- 
ric representatives. One finds finite-codimension varieties Vs in Bl that are Poincare 
dual to /i([S]). One then counts points, with sign, in \/si fl ■ ■ ■ fl Vs„ r\A4k- To make 
a topological invariant one must show that the number of intersection points is inde- 
pendent of auxiliary data, such as the metric and the choice of representatives. This 
requires careful analysis of the bubbling-off phenomena that make Aik noncompact. 

To compute the left-hand side of (1.2) we need a variety that represents fi of the 
point class x. In general //(x) is not an integral class in H'^{B*) and strictly speaking 
has no geometric representative. However, — 4/i(x) is an integral class. For p E N , 
let 

Up = {[A\ G BXj^]\F2 is reducible at p}. (1.3) 

Here F^ = {Fa — *F4)/2 is the anti-self-dual part of the curvature F4, and by 
"reducible at p" we mean that the components F~Ap) are all colinear as elements of 
the Lie algebra of G. In [SI] it was shown that Vp is a geometric representative of 
— 4yu([p]). The simple type condition can therefore be rewritten as 

#(A^fe+i n z/p n z/, n v^) = 64#(A^fe n v^), (1.4) 

where p and q are any two points in N, uj is an arbitrary formal product of homology 
cycles of A^, and V^i is a geometric representative of //(cu). 

More formally, one can write 

[Mk+ir\Upr\Ug] = u[Mk\. (1.5) 

Strictly speaking, the left hand side is an element of H^{Bk+i), while the right hand 
side is in H^,{Bk). However, Bk and Bk+i are homotopy equivalent spaces, and their 
homology classes may be identified. 

The first question studied in this paper is this: Suppose p and q are extremely 
close points in N, separated by a distance 2L. How many of the points on the left 
hand side of (1-4) Hg near the boundary of Aik+i? The answer is quite simple: 

Theorem 1.1 Let N be a compact oriented Riemannian four-manifold of arbitrary 
topology and geometry and let 4k > 36+ + 5. Fix a coordinate patch on N, and let p 
and q be the points (±L, 0,0,0). Fix u G Sjm*{H^:N), K > 0, and a G (0,2). Let 
■^t+i be the portion of the (perturbed) moduli space Mk+i consisting of a background 
of charge k and a charge-one bubble of size A < KL'^ . For generic choices of geometric 
representatives V^j of fi{uj), and for all sufficiently small L, the intersection number 
of Ml_^.l with V^nUpH Ug is 6D{uj). 

The perturbed moduli space Aik+i is constructed, and the genericity conditions 
specified, in §§3-4. This theorem is restated, more precisely, as Theorem 4.1. In this 



theorem, and throughout §§2-4, we assume k is in the indicated "stable range" to 
avoid contributions from lower strata of the compactified moduli space. 

Simple type thus reduces to a statement that the interior of A4k+i H UpH Ug be 
homologous to 58 copies of A^^. This is striking, since in general very little is known 
about the interior of Aik+i- As noted earlier, the only known embeddings of A^^ into 
Aik+i involve Taubes patching, and have an image near the boundary of A^^+i. 

On the level of differential forms, the de Rham-theoretic version of the /i-map is 
represented by a map 

^,:n^{N)^n^{Bl^,), 2 = 0,..., 4; (1.6) 

the argument of fid is a form representing the Poincare dual of the argument of /i. 
In particular, fi{x) is represented by a 4-form fid{^) £ -f^^l-B^+i) for any uj G Q'^{N) 
with X^,^ 0^ = 1. 

One can write down an explicit formula for such a representative fid{^) by ap- 
pealing to Chern-Weil theory on the canonical S'0(3)-bundle V — > 'Bfc_|_i x M (see §5). 
Furthermore given p G M, if we replace u by 6p, a delta-form supported at a point 
p, then the resulting form on Bl_^^ is still de Rham cohomologous to a form obtained 
using smooth u (although there is an important difference that we will discuss later). 
Let us write fid{p) '■= f^di^p)- Note that for smooth oo G VL'^{N) we have 

J N 

If Z is an eight-dimensional cycle in i3^_,„i, then since the cohomology class oi Hdip) 
is independent of p, for any points p,q E M we have f^ IJ'd{p)^l^d{p) = fz fJ'd{p)/\fJ'd{(l), 
and moreover this integral depends only on the homology class of Z. 

Pretend, for a moment, that the moduli spaces Aik+i,-Mk are the total space 
and base space of a compact, connected, oriented fiber bundle n : A4k+i ^ Alfc; the 
fibers would then be mutually cohomologous compact submanifolds Z C Aik+i- For 
any form G Q^°^{Aik), we would have a product formula 



l^dip) A fidiq) A 7r*0 = / fidip) A fidiq) / (1-8) 

Mk+i \Jz J \JMk / 

(assuming compatible orientations), so the simple-type condition (1.2) would be 
equivalent to 

[ fidip) A f^diq) = '^. (1.9) 

Jz 

In reality the moduli spaces are not compact and there is no such global fibration. 

However, from the current understanding of Donaldson invariants one might speculate 

that the relevant integrals are supported in a region near the ideal boundary of Aik+i- 
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Of course a random de Rham representative of a cohomology class can be supported 
wherever it likes, but /id(-) is not random, and there is evidence that its properties near 
the boundary of moduli space do indeed capture a lot of cohomological information. 
For example, consider the five- dimensional moduli spaces of 1-instantons over simply 
connected manifolds with 6+ = 0. In such cases the inverse of a collar map gives 
embeddings Tx : N ^ Aii G B* for A sufficiently small (the image of Tx consisting 
of instantons of scale A), and one has Donaldson's theorem that the composition 
T^o n : H2iN,Z) -^ H^{N,Z) is precisely Poincare duality ([DK Corollary 5.3.3]). 
The corresponding assertion in de Rham cohomology would be that r^ o/^d : fl'^{N) -^ 
Q'^{N) induces the identity on cohomology. But in fact in this context one can show 
that limA^o t"! ° f^d is already the identity map on the level of forms, in all degrees 
([GP3]). This fact extends to more general moduli spaces A^^ near the "maximal 
bubbling" ends consisting of instantons with k distinct charge-one bubbles. 

If we consider that portion A^'^+i ;^ of Aik+i,\o near the highest-dimensional 
boundary stratum Ai^ x N, there is indeed a fibration Ai'f,_^_i Xf ^ -^1 whose fibers 
can be identified with subsets of an 8-dimensional space of framed ASD connections 
on R^. (Here Ai'j. denotes the space of non-concentrated irreducible /c-instantons, 
and M-'i^^i Xq the space of {k + l)-instantons with only a single "bubble", of charge 
one, and scale less than some small number Aq.) The typical fiber Z = Za„ is itself a 
bundle over (0, Aq) x A^ for some small Aq, whose fiber over (A, p) G (0, Aq) x A^ is the 
space of "gluing parameters" Hom5o(3)(A^T*A^, Ad Pfc) ^ ^0(3) (see [DK, p. 324]). 
Since M'^+i Xo i^ ^^ch a large portion of the end of Mk+i, one might then expect that 
an approximate version of (1.9) holds under the assumption of simple type. 

What we show below is that (1.9) fails in a very precise way: independent of the 
topology and geometry of A^, if Aq is small enough and dist(j9, g) is small compared 
to Aq (but nonzero), then 

f /id(p)A/id(g)^^ (1.10) 

under certain technical but intuitively reasonable assumptions about the fiber Zx^- 
Taking a limit as g — * j9 and then as Aq -^ 0, the integral above approaches an integral 
over the space of framed instantons on R'^, and this latter integral has the precise 
value 1/2. 

At this stage the reader may wonder why we do not simply take p = q in (1.10). 
The reason is that for purposes of integration, the fid{p) turn out to be more singular 
than the representatives /id(a;) for smooth u. Were we to set p = q in (1.10), 1/2 
would be replaced by 0. This discontinuity can be modeled by the following two- 
dimensional example. Let H be the upper half-plane {{x,X) G R^ | A > 0} and for 
each L G R let ^L : /J ^ (0, tt) be the usual polar-coordinate angle as measured from 
(L, 0) (so (IOl = {{x - L)dX - Xdx)/{{x - Lf + A^)). As forms on H, the (IOl are all 
cohomologous (in fact cohomologous to zero). However, j^dOo A dOo = 0, while for 
L > we have we have /^ d9o A dOi = n'^/2. Essentially, fid{p) A fid{q) behaves like a 
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quaternionic version of this example. 

The technical assumptions on the fiber Z are enumerated as (Z1-Z5) in section 7. 
The first three of these assumptions are known to be satisfied by the fiber constructed 
in [DK], but we have not determined whether the latter two are satisfied. These two 
are assumptions on the tangent space to T[^]Z, where \A\ G Z, and we prove that 
they are satisfied by a subspace of T'^j^M. (the "approximate tangent space") that 
we argue is close to T^j^Z. Because this step is only a plausibility argument, (1.10) 
implies one of two things: either fid{p) ^ f^d{(l) has most of its support in the interior of 
moduli space (or near higher codimension boundary strata), or the intuitive picture 
of the fiber Z is significantly wrong. Either way, the conclusion is surprising. 

Our second main theorem is then 

Theorem 1.2 Let N be a compact oriented Riemannian four-manifold of arbitrary 
topology and geometry and let k > 1. Assume that a typical fiber Zx^ of the fibration 
•^fc+i A ~^ -^'k satisfies f'Zl-ZSj of^7. Then for any p,q G A^, the form fj.d{p) Afj.d{q) 
is integrable over Zxq for Aq sufficiently small, and 

lim lim / /id(p) A fiaiq) = -, (1.11) 



while 

Ao^O 



lim / fidip) A fidip) = 0. (1.12) 

The convergence in (1.11)-(1.12) is uniform inp,q. Hence if Sp^i denotes a smooth 
4-form on N of total integral 1, supported in a ball of radius L about p, then (using 
(1.7)) 

^Ijm ( |nii / fid{5p,L) A /id((5p,L) J = 2' (1-13) 



By uniform convergence in (1.11) we mean that for all e > there exist Ai, 5{-) > 
such if < Ao < Ai and < dist(p, q) < S{\o) then the integral in (1.11) differs from 
1/2 by less than e. 

It is not necessary to take the limits in (1.11) completely independently, as long 
as q —^ p much faster than Aq -^ 0. If, for example, we require that dist(p, g) = 
const Aq^" for some a > 0, and then take a limit as Aq -^ 0, we again get 1/2. 

Note that if we held p and q fixed rather than taking lim^^p in (1.11), the limit 
as Ao ^ would necessarily be zero (since fid{p) A fid{q) is integrable). It turns out 
that ioT q ^ p the integrand in (1.11) is supported in a region in which A is of order 
dist{p,q). Thus if we wish to extend Hd{p) and ^Jid{^p,L) to forms on the Uhlenbeck 
compactification of A^, with lim^^^o /^^(^p.l) = l^dip) in a distributional sense, then 
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fid{p) ^f^d{p) should be viewed as the sum of a delta-form supported on the boundary 
of moduli space and a smooth form supported away from the boundary. 

Note also that Theorem 1.2 does not require k to be in the "stable range", unlike 
Theorem 1.1. However, Theorem 1.2 is most interesting for k in the stable range, since 
only then can the Donaldson invariant D([Si] ■ ■ ■ [S„]) be expressed as a topologically 
invariant integral /_^ jJ'dij^i) A ■ ■ ■ A fiction)- 

The rest of this paper is organized into two main parts, with §§2-4 devoted to 
proving Theorem 1.1 and §§5-10 devoted to proving Theorem 1.2. The strategy of 
proof, and the division of the paper, is as follows: 

Let A be a connection obtained by gluing a small bubble onto a background 
connection Aq. It turns out that the curvature of A is well approximated by the 
sum of the curvature Fq of Aq and the curvature Fgtd of a standard k = 1 instanton, 
viewed in the correct gauge. We are thus led to the following model problem: Given 
a connection [Aq] G Aik and two closely spaced points p and q, for how many triples 
(x. A, g) is the sum of the curvature Fq of Aq and the curvature Fstd of a standard 
instanton, centered at x with size A and gluing angle g, reducible at both p and q? In 
§2 we solve this model problem and show that, for generic Aq, the answer is 6. 

In §3 we construct a family of approximately ASD connections, based on an 
explicit gluing formula. This is a perturbation, which we denote A4fc+i, of the bound- 
ary region of A^^+i. We check explicitly that in this family the curvature is well 
approximated by Fq + Fstd- By linearly interpolating between Fq + Fstd and the ac- 
tual curvatures of connections in Aik+i, we show that corresponding to each generic 
Aq E Aik there are exactly 6 points in Aik+i Cl UpH Uq with A sufficiently small. 

In §4 we apply these results to show that if we consider only the boundary region 
of the (perturbed) moduli space, we obtain (1.5) with 6 on the right-hand side rather 
than 64, thereby completing the proof of Theorem 1.1. 

Ideally, one would then like to interpolate from Aik+i to Aik+i- This is quite 
difficult, as i>p and Ug are defined by pointwise conditions on the curvature. We know 
of no pointwise estimates relating the curvature of an almost-ASD connection to that 
of a nearby ASD connection. In order to make use of the integral estimates available 
in the literature one would have to replace Up and Uq by geometric representatives 
defined by integral conditions. While certainly possible, this is beyond the scope of 
this paper. 

We prove Theorem 1.2 by exhibiting fid{p) as a purely local piece /iioc(p) plus a 
nonlocal remainder. The local piece dominates in (1.11): as q ^ p the integral of 
Aiioc(p) A /Uioc(g') approaches a calculable integral on R^, with value 1/2, independent 
of Aq. (However, /iioc(p) A/iioc(p) = 0.) We establish (1.11)-(1.12) by showing that the 
integral of the remainder terms in fj,{p) A/i(g) approaches zero as Aq -^ 0, independent 
of p and q. Thus taking a limit as g — * p is relevant only to the purely local part 



of fidip) A fJ^diq) (and taking a limit as Aq ^ is relevant only to the nonlocal 
part); the delta-form behavior of fid{p) A f^dip) is due solely to /iioc(p) A /iioc(p)- The 
uniformity assertion in Theorem 1.2 follows from the proofs of (1.11)-(1.12), and the 
final assertion (1.13) then follows from (1.7). 

In §5 we begin our work on Theorem 1.2 by constructing the de Rham represen- 
tatives fJ^dip)- The splitup fj,d{p) = /^ioc(p)+ remainder is based on the "approximate 
tangent space" mentioned above. This approximation is built by lifting the action of 
certain vector fields on N to B*. In §6 we discuss this lifted action (the "canonical 
flow"), use it to define the approximate tangent spaces Ha, and discuss how close the 
T-Ca are to being tangent to Ai. We then exhibit the relation between a fiber con- 
structed from the canonical flow (whose tangent space is essentially the projection 
to T[A]-M. of approximate tangent space above) and the fiber constructed in [DK]. 
This digression is needed to motivate the technical assumptions (Z1-Z5) given and 
discussed in §7. In §8 we return to the main track, defining H\oc{p) and computing the 
limiting integral of /xioc(p) A //.ioc(9)- §§9-10 are devoted to a study of the remainder 
terms f^d{p) A/id(9) — f^iodp) A/iioc(Q')- In §9 we state the main technical theorem that 
yields the pointwise norm of these terms (Proposition 9.2), and use this theorem to 
establish that the integral of the remainder terms tends to zero as Aq — > 0. Finally, in 
§10, we prove Proposition 9.2. It is this section that contains the core of the analysis 
underpinning the validity of all the earlier calculations. The estimates in §10 require 
a weighted Sobolev inequality, proven in the appendix, that the authors have not seen 
elsewhere. 

The authors thank the 1994 Park City Mathematics Institute, where this work was 
begun, the National Science Foundation, and the Texas Advanced Research Program. 
We also thank Dan Freed, Tom Parker, Cliff Taubes, and Karen Uhlenbeck for helpful 
insights and criticism, and Margaret Combs for assistance with the figures. 



2 The Model Intersection Theory Calculation 

In this section we begin to compare the boundary region of Aik+i \~\ Vp \~\ Vq with 
M-ki by looking at a model problem. Pick a small neighborhood U of our manifold 
A^ and give it a flat metric with corresponding Euclidean coordinates. Let U be 
the corresponding ball in R^. We will denote points either by four real coordinates 
(a;°, . . . , x^) or by a single quaternionic coordinate x^ + ix^ +jx'^ + kx^. Let p and q be 
the points (±L, 0, 0, 0). Let Aq be an ASD connection on A^, expressed in a smooth 
gauge on U. 

An important notational tool is the identification of ASD curvatures with 3x3 
real matrices. Let Fq be the puUback, to U, of the curvature F^,, of an ASD connection 
on U. Relative to the standard oriented basis of A^T*R^ {uji = dx^dx^ — dx'^dx^, 



ijj2 = dx^dx"^ — dx^dx^, u^ = dx^dx^ — dx^dx^), Fq has at each point 3 Lie-algebra- 
valued components, and so can be viewed as a triple of 3-vectors. We package this 
triple of vectors into a 3 x 3 real matrix, which we denote Mat(Fo). More precisely, the 
first, second and third columns of Mat{Fo) are half the ui, 002 and 003 components of 
Fq, while the first, second and third entries of each column refer to the three directions 
in su(2), the Lie Algebra of SU{2). Aq is reducible at a point if and only if Mat{FQ) 
has rank 1 (or 0) there. 

Of course, this construction is dependent on gauge and a choice of basis for TN . 
A gauge transformation is a change of basis in su(2), and thus changes Mat(Fo) by 
left-multiplication by an element of 5*0 (3). A change of basis in TN changes Mat(Fo) 
by right-multiplication by an element of 5*0(3). Thus the singular values of Mat{Fo), 
and in particular the rank of Mat{Fo), are gauge- and basis-independent. We shall 
frequently be thinking of curvatures as 3 x 3 matrices in this way. When the context 
is clear, we will omit the explicit function ^^Mat" . 

Now consider a standard k = 1 instanton on R^ of scale size A and center y, 
viewed in a radial gauge that is singular at y and regular at cxd. There are many such 
gauges, parametrized by a gluing angle m G 5*0(3). For fixed {y,\,m), let Fgtd be 
the curvature of this connection restricted to U. 

Let A be an ASD connection obtained by gluing in a bubble with data {y, A, m) 
to the background Aq. In §3 we shall see that Fa, in an appropriate gauge, is approx- 
imately equal to Fq + Fstd- This reduces our main question to the following model 
problem: 

When L is small, for what values of {y, A, m), with A small, is Fq + Fstd reducible 
at both p and q ? 

Of course, to obtain sensible answers, we must define what we mean by A being 
"small". Pick constants K > and a G (0,2). We say A is small (or that the 
corresponding bubble is small) if A < KL°'. The set of gluing data for small bubbles 
near p and q is B = U x (0,KL°') x 5*0(3). Let Up (resp. z>g) be the set of points 
{\,y,m) G B such that Fq{p) + Fstd{p) (resp. -Fo(g) + Fstd{q)) is reducible. We must 
count the intersection points of Up and Uq. 

Recall that the singular values ai > a2 > c^ > oi a 3 x 3 real matrix M are 
the square roots of the eigenvalues of M'^M. For M generic, these are distinct and 
positive. The non-generic cases are as follows: Matrices in a codimension-1 set have 
(73 = 0. Matrices in a codimension-2 set either have ai = a2 or (T2 = cr^. Matrices 
in a codimension-4 set have o"2 = (Ts = 0; these matrices have rank 1 or 0. Matrices 
in a codimension-5 set have cti = o"2 = (T3; these are all scalar multiples of 5*0(3) 
matrices. Only the zero matrix (co dimension- 9) has 0-1=0-2 = 0-3 = 0. 

Theorem 2.1 Fix K > 0, a E (0,2), and a background connection Aq. If the 
singular values o/Mat(Fo(0)) are all distinct, then, for all sufficiently small L, Up 
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and Vq intersect at exactly six points. These six intersections are all transverse, and 
the local intersection number is +1 at each point. 

Remark: We shall see that, under the assumptions of the theorem, the intersection 
points all have A = O(L^). However, when two of the singular values of Mat(Fo(0)) 
are the same, then there are only four intersection points with A = O(L^). In that 
case there are generically four additional intersection points with A = 0{L). The 
intersection number of i)p and Ug is then 4 if a > 1 and 8 if a < 1. 

Before beginning the proof of Theorem 2.1 we need some basic facts about k = 
1 instantons on R*^ = H, we need to fix some conventions, and we need a linear 
algebra lemma. Think of SU{2) as the unit quaternions, with 5u(2) as the imaginary 
quaternions. The connection form of a standard instanton of scale size 1, centered at 
the origin, is Agtdo = Im{xdx/{1 + |a:p)). The curvature of this connection is 

dxdx 2iuJi + 2iuj2 + 2kuj^ 

^*'" = (l + |a:|2)2 = (l + |x|2)2 • ^^-^^ 

Note that the matrix Mat(-Fstdo) is 1/(1 + \^\^Y times the identity matrix. 

That is in the usual regular gauge, in which A ~ (p^^dcj) as \x\ -^ oo, where 
0(a;) = a;/|x|. We do a gauge transformation by 0""*^, to get a radial gauge in which 
A = 0{\x\~^) as I a; I — > cxo (and in which A is singular at the origin). We then 
do a further gauge transformation by a constant go to get the most general radial 
gauge with this property. Let Fstd be the curvature form in this gauge. We have 
Fstd = 5'o"V^stdo0"^fi'o- In terms of matrices, Mat{Fstd) = p{gQ^)p{(t))Mat{F^tdo), 
where p is the standard double covering map from SU{2) to S'0(3); the three columns 
of p(0) are (f)i(f)~^, (j)j(j)~^, and (f)k(f)~^. The matrix p{go) is our gluing angle m. 

Now suppose that we have a. k = 1 instanton, centered at a point y, with scale 
size A. The curvature matrix, expressed in the exterior radial gauge of gluing angle 
m, is 

Mat{F,,,{x)) = J"' m-'p f f^) • (2.2) 

(A^ + |a; — ypj^ \\x — y\J 

Note that the matrix Mat{Fstd{x)) is a positive multiple of an 5*0(3) matrix. The 
multiple is determined by A and \x — y\, while the 5*0(3) matrix is determined by m 
and {x — y)/\x — y\. (We hencefore will not explicitly distinguish between a curvature 
and its matrix.) 

Our problem is thus one of adding positive multiples of 5*0(3) matrices to Fq{p) 
and Fo{q) to make them reducible. The following lemma is essential. 

Lemma 2.2 Let P be a 3 by 3 real matrix with singular values (Ti > (T2 > o's > 0. // 

these singular values are all distinct, then there are exactly two pairs (s, M) G (0, oo) x 
5*0(3) for which P + sM has rank 1 (and no pairs (s, M) for which P + sM = 0). 
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In both cases s = (72 (-P)- If exactly two of the singular values of P are the same and 
nonzero, then the two solutions (s, M) coalesce to a double root. 

Proof: Let W = — (P + sM). Adding sM to P to make it reducible is the same 
as decomposing — P as sM + W, with W reducible. We therefore count the ways 
to decompose a matrix — P into the sum of a positive multiple of an 5*0 (3) matrix 
and a rank 1 matrix. First we show that the desired decompositions can occur only 
with s = (72, by assuming a decomposition — P = sM + W and computing o'2(P)- 
Multiplying P on the left and right by 5*0(3) matrices does not change the singular 
values, but does allow us to set M = / and put W into the form 

a b 0\ 
W = \ . (2.3) 

0/ 

{s + a)b 0\ 
P^P = I (s + a)6 s2 + 62 . (2.4) 

sV 

One of the eigenvalues of P^P is obviously s"^, with eigenvector (0, 0, 1). Restricting 
to the upper left 2 by 2 block, we subtract s"^! and get a matrix whose determinant, 
—8%'^, is nonpositive. Thus at most one eigenvalue of P^ P is greater than s^ and 
at most one eigenvalue is less than s^. Since s^ jg the middle eigenvalue of P^P, 
a2{P) = s. 

Next we show that P + sM can have rank 1, with s = (72 (P), in two ways. By 
multiplying on the left and right by 5*0(3) matrices, we can take P diagonal, with 
entries Pn > P22 > l-Psal- Next we look for orthogonal matrices of the form 

/-cos(^) sin(^)\ 
Me= \ -1 . (2.5) 



Then 





We then have 

/Pn-P22Cos(^) P22sin(^) 

P + sMg = P + P22Me = I . (2.6) 

\ P22sin(^) P33 + P22 cos(^) 

This matrix has an obvious null vector (0, 1,0). P + sMq has rank one (or zero) if, 
and only if, there is a second null vector. To see if there is a second null vector, we 
restrict P + sMg to the 1-3 plane and take its determinant, which equals — P2^2 + 
P11P33 + (Pii — P33)P22 cos(^). This is a periodic function of 6 with a single maximum 
of (Pii - P22)(P22 + P33) at ^ = and a single minimum of -(Pn + P22)(P22 - P33) 
at ^ = TT. If Pii > P22 > IP33I, the maximum and minimum values have opposite 
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signs, so the function must cross zero exactly twice, at the points 9 = ±cos ^([^2^2 ~ 
-Pii-P33]/(-Pii - -P33)-P22)- If -Pii = P22 or P33 = -P22, then the maximum value 
becomes zero, while if P22 = -P33 then the minimum becomes zero. In these cases 
we have a double root at 6' = or tt. Finally, if Pu = P22 = P33, then the function 
is identically zero and we have an infinite number of roots. This corresponds to the 
original P being a positive multiple of an 5*0(3) matrix. 

Finally, we show that these are the only possible decompositions with s = P22- 
Suppose that M is an 5*0(3) matrix with P + sM having rank 1. Then every 2 by 2 
block of P + sM has determinant 0, and in particular the upper left 2 by 2 block has 
a null vector v. However, Pu and P22 are both at least s, so \Pv\ > s. The upper 
left corner of sM has operator norm at most s, so |sMf | < s. Thus we must have 
\Pv\ = s\Mv\ = s = P22. If Pii > P22 this means v = (0, 1,0), so Mv = (0, -1,0), 
so M must take the form (2.5). The case Pu = P22 must be checked separately, but 
leads only to the solution M = diag{—l, —1,1). | 



The form of the explicit solutions found above also demonstrates the continuous 
dependence of M on P. Expressed invariantly, M is a rotation by tt about an axis. 
This axis is orthogonal to the second principal axis of P^P, and makes an angle 
6/2 = (±1/2) cos~^([o-| ± o"iO'3]/[(o'i ± o"3)o'2]) with the third principal axis of P^P, 
where the ± is determined by the sign of the determinant of P. A small change in P 
can only change 6 by an amount of order |(5P|/ min(o"i — o"2, o"2 — as), and, by first order 
perturbation theory (integrated to get rigorous bounds), can only change the principal 
axes of P^P by a similar amount. Thus if 6P is a small perturbation of P, the norm 
of the corresponding 6M is bounded by a constant times |5P|/ min(o'i — 0-2,0-2 — cr^). 

Not surprisingly, this stability breaks down when we approach the double root. 
If two of the singular values are equal, then a small perturbation may change M by 
as much as 0(a/|5P|). 

Proof of Theorem 2.1: Let Sp be the second singular value of Fq{p), and let 
Mp G 5*0(3) be a matrix such that Fq{p) + SpMp is reducible (with similar definitions 
for Sg and Mg). Let Sq be the second singular value of Fq{0). Note that Sq > 0, since 
the three singular values of Po(0) were assumed distinct. Since Sp and Sg are within 
0{L) of So, we can bound Sp and Sq away from zero. 

We shall count the ways to simultaneously make Pstd(p) = SpMp and Fstdig) = 
SgMg. The condition for the standard curvature Pstd to have magnitude Sp at p is 

^' .„ (2.7) 



(|t/-p|2 + A2)2 

or equivalently 



>? + \y-p\^ = \/^p. {2.i 
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As long as \y—p\ < l/2ys^ there are two solutions to (2.8), while for \y—p\ > l/2y^ 
there are none. When ||/— p| < l/2y/s^, one solution has A > l/2y^, which is greater 
than KL'^ for L small. The other solution qualifies as small if \y — p\ is small enough, 
and, for \y — p\ <^ l/y^, is approximately \ = \y — ppy^- As a set in R^ = (A^, A) 
space, the solutions to (2.8) are a 4-sphere. Projected onto A^, they form (2 copies 
of) a 4-disk. In either case, only a small subset of solutions qualifies as "small" . 

The interesting question, of course, is how many times we can solve the equations 
for p and q simultaneously. We begin with equation (2.8) and the corresponding 
equation for q. The intersection of two 4-spheres in R^ is a 3-sphere. Projected onto 
A^ we get a 3-dimensional ellipsoid, possible degenerating to two disks. As before, 
only a small patch of the ellipsoid (or alternatively part of one of the two disks) gives 
a small enough value of A. It is this region that we consider. 

Recall that p and q are at ±L, where we are using quaternionic coordinates. 



'S 



1- 



For L small, Sq = Sp + 0{L). Let Sm be such that 2/^/s^ = 1/y^ + l/y 
Let A = (1/y^— l/y^)/L. As L ^ 0, Sm, = So + 0{L'^), while A approaches 
— {dSp/dL)\L=Q/sQ . Let yo and yj be the real and imaginary parts of y. Adding and 
subtracting (2.8) to the corresponding equation for q we obtain 

-4yo = \A; A' + L2 + i/o'+b/r = A/v^. (2.9) 

Plugging the first equation into the second we get 

^'(^ + ^)-^ + L' + \yi\' = (2.10) 



16 7 

This equation shows that A, and thus yo, may be viewed as functions of yj. As 
long as L^ + \yi\^ <^ 1/v^s^ there are two solutions to (2.10), one of which has A ~ 
(L^ + ||//P)a/s^, the other of which has A ~ ((1 + A'^/16)^/s^l)~^. The first solution 
has A < KL'^ if and only if \yj\ is small enough, while the second solution always has 
A > KL°'. Let RK,a be the largest number such that ||//| < RK,a implies A < KL°'. 
Henceforth we consider only "admissible" y, i.e. those with \yi\ < Rx^a- For L chosen 
small enough, as we assume henceforth it is, R^a ~ KL'^/^/s^ — L"^ r^ K L^ / ^/s^, 
since a <2. Note that 

|2 



I/O = -AA/4 ^ -{L' + ||/,Hv/i;;A/4. (2.11) 

Hence for admissible y, we have \yi\ < constL"/^ and |t/o| < const L". Let r = 
(?/o(0),0) be the unique admissible point where the ellipsoid of solutions {yo,yi) to 
(2.9) hits the real axis. Since \r\ = 0{L'^), r lies on the line segment pq, and the 
ellipsoid has curvature 0(1) at r. See Figure 1. 
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Figure 1 

We still have to get the 5*0(3) matrices right. This means simultaneously solving 
the equations rn~^p{{y—p)/\y—p\) = Mp and m~^p{{y — q)/\y — q\) = Mg for m. If a 
solution exists it is obviously unique. A solution exists if and only if p{{y—p)/\y — 
P\)~^p{{y-(l)/\y-(l\) = M-^Mq. Let g{y) = {y -p){y -q) /\{y -p){y -q)\. We must 
count the points on our 3-disk (of small solutions to (2.9) and (2.10)) for which the 
S0{3)-valued function p{g{y)) equals M~^Mg. Note that 



.yi 



g{y) = -I + 2i^(l + Oii\yo\/Ly)) + 0((|y,|/L)^) for \yj\ « L, 



L 



(2.12) 



while 



Ly 



g{y) = I + 2^(1 + 0{{\yo\/\yi\r)) + 0((L/|y,|)^) for \y,\ » L. (2.13) 



\yi\ 



\yi\r) in (2.13) 



In view of (2.11), we can replace 0{{\yo\/Ly) in (2.12) and 0{{\yo\ 
by 0{L'^) and (9(L^"), respectively. 

Observe that L/Rx^a is 0(L^^°/^) and hence goes to zero as L ^ 0. On the disk 
of admissible yj, the map g covers all of SU{2) except for a ball of radius cL^~"/^ 
around the identity, for some constant c. Since p is a 2-1 map, p{g{y)) hits all of 
SO (3) twice, except for a ball of radius 2cL^~"/^ + 0{L'^~") around the identity, 
which is only hit once. The number of solutions to our problem depends on whether, 
for small L, M~^Mq is in this ball or not. 

If the singular values of -Fo(O) are distinct, then, by Lemma 2.2, there are two 
distinct matrices Mi^2(0) for which -Fo(O) + cr2(0)M has rank 1. By the comment 
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after the proof of Lemma 2.2, the two matrices for p and q satisfy Mi^2{p,(l) = 
Mi,2(0) + 0{L). As L ^ 0, Mi{p)-^M2{q) and M2{p)-^Mi{q) are bounded away 
from the identity, but Mi(p)~^Mi(g) and M2{p)~'^ M2{q) are within 0{L) (and hence 
within 2cL^~"'^ + 0(L^~")) of the identity. Thus we have two configurations in 
{y, \,m) space that give SpMi{p) at p and SqM2{q) at q, two that give SpM2{p) at p 
and SgMi(g) at g, one that gives SpMi{p) at p and SgMi(g) at g and one that gives 
SpM2{p) at p and SqM2{q) at g. A total of six solutions in all. 

On a codimension-2 set of background data, the background curvature at the 
origin has two equal singular values, so Mi(0) = M2(0) and Mi^2{p,q) = ^i(O) + 
OiL^/"^). In that case all 4 possibilities have M'^Mg = 1 + OiL^/"^). If a > 1, this is 
within 2cL^~"' ^ of the identity for small enough L, and so each possibility yields one 
solution. If q; < 1, and the 0{L^^'^) term in the expansion of Mi^2{p,q) in powers of 
L is nonzero, then each possibility yields two solutions. 

Finally we consider the orientations of our solutions. It is not immediately clear 
that all solutions have the same orientation, but in fact they do. The problem of 
matching amplitudes is the same in all cases. The problem of matching gluing angles 
reduces to the intersection of two 3-cycles in a 3-disk x 5*0(3) (i.e., all possible pairs 
{yi, m)), and is easily seen to be transverse. The intersection numbers are continuous 
functions of Mp and Mg, as long as a solution continues to exist. Sending Mp around a 
noncontractible loop in SO {3) interchanges the two solutions associated to a given pair 
{Mp, Mq), which shows that the two solutions for any given {Mp, Mq) have the same 
orientation. Also by continuity, this orientation is the same for all pairs {Mp, Mq). 

All that remains is to compute this orientation in one case. Let Sp = Sq = 1, 
Mp = Mq = I, and look near the solution with y = and m = I. The varieties Vp 
and i)q are just the zero sets of -Fstd(p) ~ I and -Fstd(Q') "~ I) which we view as functions 
of {y,\,m). Taking derivatives we find that changes in {y,X,m) give the following 
first order changes in -Fstd(p) and Fstd{q)'- 

1. Increasing A increases the magnitude of both -Fstd(p) and Fstd{q) without chang- 
ing either direction. 

2. Increasing yo increases the magnitude of Fstd{p) and decreases that of Fstd{q), 
while keeping the directions fixed. 

3. Increasing yi (resp. t/2, ys) rotates Fstd{p) in the direction defined by the Lie 
algebra element —i (resp. — j, —k), and rotates Fstd{p) an equal amount in the 
direction +i (resp. +j, +k). 

4. Rotating m in any direction rotates both Fstd{p) and Fstd{q) in the opposite 
direction. 



From this we deduce that the Jacobian \d{Fstd{p),Fstd{q))/d{y, X,!!!,)] is positive, 
and so that the local intersection number of Up and Uq is +1 in this case. Thus the 
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local intersection number of Up and Uq is +1 in all cases. 



Having proven Theorem 2.1, we consider the question of stability. How much do 
our intersection points move around if we change Mp or Mg or Sp or Sg slightly? Since 
-fo + -^std is only an approximation to the true curvature of a connection in A4k+i, 
our results must be stable in order to be meaningful. 

Let X be the map that takes {y,\,m) to (Fstd(p), -f^std(5'))- Near our solutions, 
dx is never close to singular. By changing A and one component of y we can adjust 
|-f^std(p)| and |Fstd(5')| independently, while by adjusting m and the remaining three 
components of y we can adjust the directions of -Fstd(p) and Fstd(?) independently. It 
is not difficult to estimate the matrix elements of {dx)~^- Some are 0(1), some are 
0{L), and some are 0{L?'). If we know the required -Fstd(p or q) to within e, we know 
m to within 0(e), y to within 0{eL), and A to within 0{eL'^). In short, small errors in 
the input data result in only small changes of the locations of our intersection points 
in (y, A,m) space. 

Finally, we consider a perturbation of our model problem that is more directly 
applicable in the sequel. Let Fq{x) be the curvature of the background connection 
in the standard radial gauge about the gluing point y. (That is, use the original 
connection Aq to trivialize the fiber over y, and then use parallel transport radially 
outwards from y to trivialize the bundle over U .) We wish to count the number of 
ways to make Fq + Fgtd reducible at both p and q. 

Theorem 2.3 Under the assumptions of Theorem 2.1, the number of ways to make 
Fq + Fstd reducible at p and q (counted with sign) is the same as the number of ways 
to make Fq + Fstd reducible at p and q (counted with sign), namely +6. 

Proof: We first put our background connection into a radial gauge with respect to 
the origin. This is a fixed gauge, and Theorem 2.1 applies. Since Fq and Fq are related 
by a gauge transformation, the singular values of Fq and Fq are the same. Thus we 
must solve (2.8), and the corresponding equation for g, exactly as in Theorem 2.1, 
with the same values of Sp and Sg. We then solve m~^ p{{ii — p)/\y — p\) = Mp and 
"iT^^^ p{iy~(l) /\y~(l\) = ^q for m, as before. The only difference in our analysis is that 
Mp and Mg are now functions of y. We compute the extent to which they depend on 

y- 

Let A be a connection in radial gauge with respect to y, and let A' be the same 
connection in radial gauge with respect to y'. The gauge transformation that relates 
these, evaluated at the point p, is the holonomy around a triangle from p to y to y' 
to p, and so its difference from the identity is bounded by the sup norm of I-F4I times 
the area of the triangle. See Figure 2. 
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Admissible solutions 

(yo,y.) to (2.9) 




Figure 2 



In our case, A is tlie background connection, so \Fa\ is fixed and bounded, and y 
and y' are restricted to lie on the ellipsoid of solutions to (2.9) and (2.10), with \yi\ 
and \y'j\ both less than RK,a- Note that the area of a triangle is bounded by half the 
product of the length of any two of its legs. Because the curvature of the ellipsoid of 
solutions y = {yo,yi) is 0(1) at admissible points, \yo — y'^l is bounded by a constant 
times \yi — y'j\. As a result. 



\Mp{y) - Mp{y')\ < const x ^/iX 



\y\'^\yi 



y'i\ 



while 



\Mp{y) - Mp(0)| < const x L\yi\ 



(2.14) 



(2.15) 



with similar estimates for M„ 



The second result is an estimate on Mp itself, while 



the first leads to a bound on the derivative of M with respect to yi. 
L^ + |yP < const {L^ + |y/P), so we obtain 



\dMp/dyi\ < const x ^/IXT 



yi\ 



By (2.11) 



(2.16) 



As before, we look for solutions to p{g{y)) = M~^Mq, where now the right hand 
side depends on y. We break the disk of radius RK,a = 0(L"'^) into two pieces, a inner 
disk Di and an annulus D2. The radii will be chosen such that on Di the estimate 
(2.14) is strong enough to allow implicit function theorem arguments to apply. Here 
the solutions to p{g{y)) = M~^{y)Mg{y) are but small perturbations of the solutions 
to p{g{y)) = Mp^{0)Mq{0). On D2 the estimate (2.15) will be strong enough to show 
that there are no solutions to p{g{y)) = M~^{y)Mq{y). Taken together, this will 
prove the theorem. 

On the disk Di, the implicit function theorem will apply as long as the small- 
est singular value of d{p o g)/dyi is at least twice the largest singular value of 



1^ 



d{Mp^Mg))/dyi, which by (2.16) is bounded above by a multiple of (L^ + |y/P)^^^. 
Computing the derivative of p o 51 is an easy geometrical calculation, and one finds 
that all singular values are bounded below by a constant times L/{L'^ + |y/P). Com- 
paring L/^LF' + ||/7p) to (L^ + \yi\'^Y''^ we see that the implicit function theorem 
applies whenever \yj\ is smaller than a constant times L^'^, and in particular when- 
ever \yi\ < L^/^ (and L is sufficiently small). We take the radius of Di (and the inner 
radius of D2) to be L^/^. 

Now consider yi E D2- If a > 1, then D2 is empty, so we assume a < \. By 
(2.13), \I-p{g{y))\ = 2L/|t/,|(l + 0(L")) + 0(LV||/,|2). Since cL^' < \yj\ < c^L^I^ 
c^L^-'^l'^ <\I - p{g{y))\ < caL^''^. Now recall that M-i(0)Mg(0) is either bounded 
away from the identity or is within 0{L) of the identity (e.g. Mf ^(p)M2(g) is bounded 
away from the identity while Mi^{p)Mi[q) is within 0[L) of the identity). By (2.15), 
M~^{y)Mq{y) is also either bounded away from the identity or within 0{L) of the 
identity on D2. Thus |/ — M~^(y)Mq{y)\ can never be between csL^""/^ and c^L^^'^, 
so there are no solutions to p{g{y)) = M~^{y)Mq{y) on D2. | 



3 The Perturbed Moduli Space 

In this section we show that the model problem of §2 correctly describes the intersec- 
tion of Up, Uq, and a perturbation (denoted Aik+i) of the boundary region of A4k+i- 
Aik+i is parametrized by quadruples {AQ,y, X,m), where Aq G M.k is a background 
connection, and the glued-in bubble has size A, center y and gluing angle m. We 
construct Aik+i by an explicit gluing formula and show that, in the relevant region, 
the curvature of a connection in Aik+i is well approximated by the sum of the back- 
ground curvature Fq and the curvature Fstd of a standard instanton of size A, center y 
and gluing angle m. Our model problem was essentially to make this sum reducible at 
p and q. By interpolating between this sum and the actual curvature of a connection 
in Aik+i, we show that the results of §2 carry over almost word for word. 

As before, we pick a background connection Aq E Aik and constants K > and 
a G (0,2). Let the neighborhood U in A^, and the corresponding neighborhood U of 
the origin in R^, be as in §2. We now allow bubbles to be glued in anywhere (not 
just in U), so the set B of gluing data is a {0,KL°') x 5*0(3) bundle over A^, with 
local coordinates {y, A, m) G A^ x (0, KL*^) x 5*0(3). When the center of the bubble is 
in f/, we identify the center point in A^ with the corresponding coordinate in f/, and 
call both points y. For each (y, A,m) G -B, let F be the curvature of the connection 
(Ao,!/, A,m) G A^fc+i- The variety Up (resp. Vq), restricted to the fiber of ^A.k+l over 
Aq, is the set of points {\,y,m) G B such that F~{p) (resp. F~{q)) is reducible. We 
must count the intersection points of Up and Uq. In this section we prove 
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Theorem 3.1 Fix i^ > 0, a G (0, 2), and Aq G Aik- If the singular values of Fao{^) 
are all distinct, then, for all sufficiently small L, the intersection number of Vp, Vq, 
and the fiber of Ai^+i over Aq is +6. 

We begin by constructing the space Alj._|_i. For now, assume we are gluing a 
bubble of size A in f/, with the center point at the origin. There are three natural 
length scales determined by the background connection Aq. The first is the length 
scale |F4q(0)|~^/^ of the background curvature at the origin. The second is the length 
scale |F4„(0)|/|V^FyiQ(0)| at which this curvature varies. Let R^ be the smaller of 
these two length scales. Finally, let Sq be the second singular value of F4„(0). It is 
easy to see that sq < V-^i) but there is no simple lower bound for sq (although, by 
assumption, sq is always positive). As we have seen, sq measures how far -Fao(O) is 
from being reducible. 

Now pick additional length scales Ri and R2, which can depend on A, -R3 and Sq; 
such that R\ < lO^^A-Rs and R2 > lO^A/^/s^. When A <^ R3, which is the only case 
we will consider, we want X <^ Ri <^ R2 '^ R3. The points of interest x will all have 
i?i < |a;| < i?2- The number 10^ is of course arbitrary. It is just chosen large enough 
that we can safely ignore small numerical factors. 

Let /3(r) be a smooth monotonic function that equals zero for r < 1/2 and equals 
1 for r > 2, and such that P' < 1. We define cutoff functions Pi{x) = P{\x\/Ri) and 

P2{X) = 1-P{\X\/R2). 

Let Aq be the background connection expressed in a smooth fixed radial gauge 
with respect to the origin. Let Agtd be the connection of a standard instanton of 
size A expressed in a radial gauge that is singular at the origin and regular at cxd. 
(This gauge is not unique; it depends on a gluing angle m. See the discussion before 
expression (2.2)). Note that \Astd\ ~ A^/r^ for r :^ A while l^ol ~ ''"I-^AoI = ''"/-R3 for 
r < i?3. 

Our point {Aq, 0, A, m) G M.k+i is defined by the connection form 

A' = /3,AQ + P2A,td- (3.1) 

We compute 

F = FA'=dA' + A'AA' = (3iFa,+(32Fa^,, 

+ {Pi - /3i)Ao A Ao + {(3l - (32) A,,^ A Astd 

+ rf/?i A Ao + rf/32 A Atd 

+ /3i/?2(^tdAAo + AoAAstd), (3.2) 

and the interpolating 2-form 

Fi = t(FAo + Fstd) + (l-t)F, (3.3) 
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where < t < 1. 

If the bubble is to be glued in at a point y, rather than at the origin, we must 
adjust the formulas as follows. First suppose y E U. Take Aq as the connection 
of the background in radial gauge with respect to y (not with respect to 0). The 
quantities sq, Ri, R2, and R3 are computed from the curvature FA^iy), not Fao(O). 
The connection Agt^ is in a singular radial gauge with respect to y (not to 0). The 
cutoff functions are /9i (x) = P{\x — y\/Ri) and/92(a;) = 1 — P{\x — y\/R2). With these 
modifications, we still have A' = PiAq + /32Astdj and formulas (3.2) and (3.3) still 
apply. For y ^ U, just apply the same formulas, using geodesic normal coordinates 
around y. In this case the "standard instanton" Agtd is no longer exactly anti-self- 
dual, but becomes anti-self-dual in the A — * limit. The gluing angle m depends on 
a local trivialization, but the set of gluing angles is invariant. This defines the space 
Aik+i for all y. 

In §2 we distinguished notationally between radial gauge with respect to and 
radial gauge with respect to y, calling the background curvature Fq in the first case 
and Fq in the second case. Theorem 2.1 discussed making -Fq + Fstd reducible at p and 
q, while Theorem 2.3 discussed making Fq + Fstd reducible at p and q. In this section 
the background connection is always in radial gauge with respect to the gluing point 
y. With only one case to consider, we always write Fq, never Fq. 

Note that we do not use the gluing formula found in standards works such as 
[DK]. Traditionally, one takes A" = (1 — /?2)^o + (1 ~ l3i)Astd, so that the resulting 
connection is exactly flat in the annulus with radii 2Ri and -R2/2 around y. This makes 
identifying the bundles on which Aq and A^i^ live conceptually easier. However, such 
a procedure makes for a perturbed moduli space on which Up and Ug intersect non- 
transverely, since F^„ is reducible, indeed zero, on the entire annulus 2i?i < r < -R2/2. 
This makes the intersection number effectively impossible to compute. 

Instead, we allow the supports of (3iAq and /92^std to overlap, as in Taubes' 
work such as [T]. This allows us to observe the interaction between the background 
connection and the glued-in instanton. In the [DK] method, the interaction only 
occurs when we go from our explicit approximate ASD connection to the true ASD 
connection (something we have relatively little analytic control over). In our method, 
the interaction is seen at the level of the approximate connection A', which we can 
calculate. Moreover, F^, is much smaller than F^„ (in the L^ norm), so our method 
should give a closer approximation to the properties of the true moduli space. 

Let ut^p (resp. ut^q) be the set of gluing data {y, A, m) with A < KL" for which 
Ff{p) (resp. Ff{q)) is reducible. If y is not in U, then, for small enough A, the 
connection form near p is exactly Aq. By assumption, Fq is not reducible at the 
origin. For small enough L, therefore, Fq is not reducible at p, and Ft{p) = Fq{p) is 
not reducible. We may therefore assume, without loss of generality, that our gluing 
point y is always in U. Indeed, by picking L small enough we may assume that y is in 



21 



an arbitrarily small neighborhood of the origin, and therefore that -Fo(y) is arbitrarily 
close to -Fo(O)- Thus we may take the length scales Ri, R2, and -R3 to be independent 
of y (although Ri and R2 may depend on A) . 

We consider five possibilities: 

1. 1. \p — y\ < Ri/2, {p is in the "interior zone", where /9i = and (32 = 1), 

2. II. Ri/2 < \p — y\ < 2Ri, {p is in the interior "shoulder"), 

3. 111. 2i?i <\p-y\< R2/2, {p is in the "plateau", where (3^ = (32 = 1), 

4. IV. -R2/2 < \p ~ y\ < 2R2, {p is in the exterior "shoulder"), and 

5. V. \p — y\ > 2i?2, {p is in the "exterior zone", where (3i = 1 and (32 = 0). 

As in §2, we will be identifying curvatures with 3x3 real matrices. The phrase 
"the second singular value of F" , for example, is shorthand for "the second singular 
value of Mat(F-)". 

The problem of Theorem 2.3 was to find ui^p, vi^q and count their intersection 
points. In that problem condition III always applied, with \p~y\'^ ~ X/^/s^. We will 
show that F^ip) being reducible with A < KL"" also implies condition III, and that 
Vt,p is a small perturbations of vi^p. We establish condition III by showing that the 
other conditions lead to contradictions. 

We begin by considering condition I. Where |a; — 1/| < -Ri/2, A = A^t^ has ASD 
curvature, so Ff = Ft = tFA^■,+FstA■ For Ft to be reducible at p we need |Fstd(p) I = ^Sp- 
That is, A^ + |p — yp = \/ ^Jts^. This quadratic equation has two solutions, one with 
A K, \p—y\'^^Jts^, the other with A ^ 1/^/ts^, but neither is consistent with condition I. 
Since \p—y\ < R3, Sp is close to Sq. Since \p—y\^ < Rf < 10~®Ai?3, while ^/tso < I/R3, 
one cannot have A ~ |p — y\'^^/ts^. The second solution has A ~ 1/y/ts^ > R3, which 
contradicts A <^ R^. Thus condition I is impossible. 

If p is in the interior shoulder we have additional terms to consider: 

Ft = F,td+it+il-t)(3i)FA,M'^-t)[i(3^-(3i)AoAAo+d(3iAAo+(3iiA,tdAAo+AoAA,td)]- 

(3.4) 
The ASD part of the terms after Fgtd can be bounded in norm by I/-R3 + 4i?i/i?| + 
A/Rl + \^/R\Rl < 100/i?3, and so the second singular value of Ft is within 100/i?3 
of the second singular value of -Fgtd- For FfT to be reducible, |-Fstd| can be at most 
100/i?|. Thus we need A/(A^ + \p — yP) < IO/-R3, which in turn means that either 
A > -R3/IOO or A < 100i?^/i?3. The first is not allowed, as A is assumed small. The 
second contradicts the definition of Ri. So condition II is also impossible. 

If p is in the exterior zone, we have Ft = F^^ + (1 ~t)Fstd, so we need A/(A^ + \p — 
yP) = a/sp/(1 — t), or equivalently A = (A^ + |p — y\'^)^Jsp/{l — t). But \p — yp > 
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2Rl > lO^A/y^, so iy/sp/(l — t) always exceeds A/(A^ + \p — yP)- So again we have 
a contradiction. 

If p is in the exterior shoulder, we have 

Ft= {t+{l-t)P2)Fstd+FAo + {^-t)[{Pl-p2)A,tdAA,,^+dp2AA,td+p2{AstdAAo+AoAA,td)]. 

(3.5) 
The ASD parts of the terms other than Faq have total magnitude bounded by A^/i?2 + 
XyRi + \^/Ri + \^/RlRl < 10AVi?t < lO-i^so < lO'i^Sp. But Fa, is a distance 
greater than Sp/2 from the nearest reducible matrix, so Ff{p) cannot be reducible. 

Thus for all points in Ut^p condition III applies, and here the analysis is relatively 
simple. The cutoff functions are both 1, so F{p) = F4„ + Fstd + (1 — t)(Astd AAq + Aq A 
Astd)- This last term has magnitude bounded by \^ / R\R\^ and changes only slightly 
as (y, X^rn) are varied. It can thus be treated as a perturbation of Faq- We perturb 
z/i^p to Vt,p iteratively (as in the standard proof of the inverse function theorem): Given 
a point in z/i^p, compute (1 — t)(a A Aq + Aq A a), use that to adjust (y. A, m), compute 
the change in (1 — t)(Astd A Aq + Aq A a), adjust (y. A, m), and so on. The iteration 
converges geometrically. Similarly, a point in Vt,p can be perturbed to a point in v\^p. 

Of course, the same analysis applies to ut^q. 

Now we consider the number of intersection points of vt,p and vt,q-, as a function of 
t. The only way the intersection number can change is if intersection points appear or 
disappeared at the ends of vt,p or vt,q- However, we have shown that such intersection 
points can only occur when both p and q are in the plateau. In the proof of Theorem 
2.3 we saw that, for X ^ L^ but A ^ 1 (e.g., A ~ KL"), the points of z/i p are bounded 
away from uig. Since condition III applies, for A ~ KL°', Vt,p and Vt,q are close to z/ip 
and z/i,g, respectively, and so are bounded away from each other. Thus intersection 
points between Vt,p and Vt,q may not appear from or disappear to the boundary. Thus 
i^{^o,p n i'o,q) = #(^i,p n ^i,q)- By Theorem 2.3, the latter number is +6, regardless 
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4 Computing the Donaldson Invariants 

In sections 2 and 3 we saw that, for a fixed generic background connection, there are 
six ways to glue in a small bubble near p and q so as to make the curvature reducible at 
p and q. In this section we demonstrate that this is sufficient information to compute 
the contribution of the boundary region of Aik+i to the simple type condition. For 
generic choices of representatives (of the classes other than /i(p) and fi{q)), and for 
generic choice of the location of the origin of our coordinate system, the boundary 
region contributes 6/64 of what is needed for simple type. 

23 



We continue the notation of §§2-3. M.k+i is the perturbed moduh space and U 
is a fixed ball in A^ with a Euclidean metric, which we identify with a neighborhood, 
U, of the origin in R'^. For fixed K, a, L, let A1°_|_i be the subset of Aik+i with 
A < KL°'. Let a; be a formal product of cycles [Si], . . . , [S„] G H^{X) such that 
deg(/i((x;)) = dim(A^fe), so that the Donaldson invariant D{ijj) is computed on the 
/c-th moduli space M.k- 

We assume that the classes {[Sj]} are represented by smooth submanifolds {Sj} 
in general position. In particular, a subset of the {Sj} can intersect only if their 
codimensions add up to 4 or less. Pick tubular neighborhoods {Sj} of {Sj} small 
enough to have the same property: a subset of the {Sj} can intersect only if the 
codimensions of the corresponding Sj's add up to 4 or less. Similarly, we assume 
that the Sj's do not intersect our fixed ball JJ . Choose a geometric representative 
Vi of each /i([Sj]) that depends only on the connection restricted to Sj. This may 
be done for the 1, 2, and 3-dimensional cycles as in [DK], and for the 0-dimensional 
E's as in [DK] or [SI]. (This allows us to identify the geometric representative of 
/x([S]) on Bk with the geometric representative of /i([S]) on Bk+i- In each case it 
is the set of connections whose restriction to S satisfies a certain condition). Note 
that the codimension of l^ in i3 is the codimension of Sj in A^. Let K; = ^iVi- K; 
is a geometric representative of /i(c<j). Generically, K) will intersect M.k at a finite 
number of points (this number, counted with sign, is the Donaldson invariant D{uj)), 
and each of these points will exhibit generic behavior. In particular, for each such 
point Aq, we can assume that Mat(FAQ(0)) has three distinct singular values. 

Theorem 4.1 Fix U,uj,K > 0, and a G (0,2). For generic choices ofV^^, as de- 
scribed above, and for all sufficiently small L, the intersection number of M.^k+i with 
Vi^nUpHUq is 6D{uj). 

Proof: We need to show that the only way for the boundary region of Aik+i to 
intersect K; H z/p fl Ug is if a bubble is pinching off near p and q, while the background 
connection in A^^ is contributing to D{uj). We then must demonstrate that, under 
these circumstances, the problem reduces to the counting problems studied in §§2-3. 

Suppose we have a small bubble centered at a point y that is not in U. The point y 
can lie in at most 4 of the Si's, with the corresponding Sj's having total codimension 
4 or less. Recall that we are using the explicit formula (3.1), and that outside a 
neighborhood of y the new connection is identical to the background connection. 
For small A, therefore, the bubble inserted at y has no effect on the connection 
restricted to the remaining S's (which we index by j). Therefore, for a connection 
{Aq, \,y,m) G A^fe+i to lie in fljl^, the background connection Aq G A4k must lie in 
r\jVj. However, Aik has dimension 8 less than Aik+i, while f^jVj has dimension at 
most 4 more than DiVi. Since the dimension of Aik is less than the codimension of 
HjVj, rijVj n A^fc is generically empty. 
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Next we consider the case where a small bubble is centered in U. Then {Sj} is 
all the cycles S except the two points p and q. For small A, on each of the S^-'s the 
connection form is equal to the background connection Aq, which must therefore be in 
CijVj CiAik- However, now the dimension of Aik and the codimension of CijVj match. 
r]jVj n Aik is, by our genericity assumption, a discrete set of points, whose number 
(counted with sign) is the Donaldson invariant D{uj). For each of these points, the 
singular values of Mat(F4(,(0)) are distinct. 

By Theorem 3.1, for each such background Aq, and for L small enough, there are 
exactly 6 values of {\,y,m) such that {Aq, \,y,m) G -Ml^i has reducible curvature 
at p and q. Furthermore, the intersection numbers for the local problem are all +1. 
Now the orientation of Ml+i is the same as that of Mk xU x {0, KL°') x 5*0(3) [D3, 
§3]. 

Thus the contribution of points {Aq, \,y,m) to D{[p] ■ [q] ■ u), for fixed Aq, is 
exactly 6 times the contribution of Aq to D{uj). Summing over the finite set {^o}; 
we get that the contribution of M.k+i to D{\p\ • [q] • u) is QD{uj). | 
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5 Differential forms and the /x-map: Introduction 

Theorem 1.1, restated precisely as Theorem 4.1, is one of the two major results of 
this paper. It quantifies the contribution of the boundary region of moduli space to 
the geometric representative computation of the Donaldson invariants that appear in 
the simple type recursion relation. The remainder of the paper is a proof of Theorem 
1.2, which quantifies the contribution of the boundary region to a differential forms 
calculation of the same Donaldson invariants. 

In this section we construct a de Rham-theoretic version of Donaldson's /x-map 
using Chern-Weil theory. Recall that there is a canonical S'0(3)-bundle V -^ B* x N, 
and that the /i-map is defined by slant product with —jPiiV). Using the L^ metric 
one can produce a natural connection on V, with curvature J-'; see [DK §§5.1-2]. By 
the Chern-Weil formula one has 

-7Pi(^) = Atr(^A^) G n\B* X iV), (5.1) 

where the trace comes from the two-dimensional representation of 5o(3) = su(2). Let 
us write tangent vectors to i3* x A^ as pairs {a,X) with a G TB* and X G TN, 
and identify TaB* with ker((rf^)*) C n^{AdP). Further, for a,(3 E n\AdP), define 
{a, P} = —"^i^Qlf^i, Pi] G Q'^{AdP), where the local Ad P-valued functions a^, Pi 
are the components of a,P relative to a local orthonormal basis of T*N. If A is 
irreducible, then JF((q;, 0), (/3, 0)) = — 2G^{q;,/3}, where Gq is the inverse of the 
covariant Laplacian on Q^{AdP), and hence 

/id(a;)(a,/5,7,p)|^ = / (6(p,o)i(7,o)i(/3,o)i(a,o)^^tr(J^AJ^) ) w 



N 



iTC 



\ I tr(G^{a,/?}G^{7,p} + G^{a,7}G^{p,/9} 

^ JN 

+ Gt{a,p}G^{P,^})u. (5.2) 



For our application it is crucial to get the combinatoric factors in (5.2) correct. 

If we replace uo by (5p, a delta-form supported at a point p, the resulting form on 
B* is still de Rham cohomologous to a form obtained using smooth uo. Henceforth we 
write /irf(p) := fJ^di^p)- For any p E N, a 4-form representing fid{p) is given by 



fXd{5p){a,p,-f,p) 
1 



TT" 



tr {G^{a, P}G^{i, p} + G^{a, i}G^{p, P} + G^{a, p}G^{P, 7}) 



.(5.3) 



To make use of (5.3) we need some concrete formulas — with calculable leading 
terms and small remainders — for G^jo;, P}. We can obtain such formulas when A is a 
concentrated instanton with a "charge-one bubble" and a,P come from infinitesimal 
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changes in the bubble parameters (center, scale, and gluing angle). Tangent vectors 
of this type span an "approximate tangent space" on which very strong estimates are 
possible. This space, its relation to the action of the quaternionic affine group on 
R"^, and its relation to the gluing construction in [DK], are central to the proof of 
Theorem 1.2. In the next section, we define the approximate tangent space precisely 
and study these relations in detail. 



6 Group actions and the approximate tangent space. 

Let H denote the quaternions and H* the nonzero quaternions. The eight-dimensional 
approximate tangent space we define later is obtained by an "almost-action" of H* x 
H = R+ X SU{2) X R^ on B, induced by an almost-action on P (what "almost- 
action" means is explained below). Essentially, we lift from N to P cut-off versions 
of translations, dilations, and "self-dual rotations" in a gauge-invariant way. 

To make this more precise, let X be a vector field on A^, and A a connection on 
P. The pair {X, A) defines a flow on P obtained by lifting X A-horizontally to P. 
We thereby obtain from X the "canonical flow of X on A" , with associated vector 
field Ah^ X := lxFa E n^{Ad P) = TaA (see [GPl] Proposition 4.3). The canonical 
flow is invariant under the gauge group, hence descends to B. Moreover any two lifts 
to P of a diffeomorphism of A^ differ by a gauge transformation, and hence given an 
action on A^ by any connected Lie group G on A^, the canonical flow integrates to a 
well-defined action of G on B, though in general not on A. Of interest to us later 
will be the comparison of the canonical lift to that obtained by lifting X horizontally 
with respect to a reference connection Aq. In that case the difference between the 
two tangent vectors in T^A induced by the two flows is d^u, where u = lx{A — Aq). 

Now let G be a Lie group acting from the left on A^. Suppose that for each g E G, 
the action $g oi g on N lifts to a bundle map ^g : P ^ P; ii G is connected we can 
obtain such lifts by using the canonical flow. (We do not require the lifts to piece 
together to a G-action.) For later purposes we will need to calculate the differential 
of the induced action of G on B, at any g E G. This is not difficult, but it is easy to 
confuse the roles of g and g^^ in this calculation, and this mistake would be fatal for 
our application. 

For each connection A E A, let Q^ E Q^{P,su{2)) denote the connection form 
of A. Given a lift (pg as above, define g ■ A to he the connection with connection 
form ($_ ^)*0^. If the lifts piece together into an action of G on P (necessarily a left 
action), then {g,A) -^ g ■ A defines a left action of G on A. $3132 ^ind ^g^ o ^^^ are 
gauge-equivalent, since both are lifts of $3132; so an element-wise liftable G-action on 
A^ always induces a G-action on B, whether or not it induces one on P. 



Now fix [Aq] G B and define p : G ^ i3 by p{g) = [g ■ Aq]. This is well-defined and 
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is independent of the choice of hfts. On a small enough neighborhood U of any g E G 
we can always choose the ^h to vary smoothly with h, so that on U the map p factors 
through a smooth map p : G -^ A defined by p{g) = g ■ Aq. Let v = ■^gt\t=o G TgG 
and write v = Rg^,w, where w G T^G = Q. Then 



d 
It' 



P*g^ = — ((6^P(^^)fi') ■ ^Oy 



(6.1) 



t=o 



But ^cxp{tw)g = lit)°^cxp{tw)°^g for some gauge transformation 7(t) varying smoothly 
in t, and hence {exp{tw)g) ■ Aq = ((exp(tu')) ■ g ■ Aq) ■ 7(t)). Thus 



P*gV = —{exp{tw)-g-Ao) 



mod Im.{dg.Ao] 
t=o 



mod lm.{dg.A^t)■ (6.2) 



t=o 



Let w G r{TN) and wp G r(TP) be the vector fields on N and P induced by the 
infinitesimal action of w. Then 



((*exp{to))*^9-A. 



dt 



-CuipUJg-Ao = -i^wFg.Ao mod lm{dg.Ao) (6-3) 



t=o 



(if $ is defined by the canonical flow then the "mod lm{dg.Ao)" '^^^ t)e erased in this 
line). Note that v directly defines a vector field on N by v\^^(^x) = -^{^gti^))\t=o- 
Since we can take gt = exp{tw)g, it immediately follows that -01$ (^x) = "U^l* (x) for all 
a; G A^, so the vector fields v and w are the same. Hence 

p^gV = -bi,Fg.A^, mod \m{dg.Ao)- (6-4) 

Thus if we identify TyA]B with ker((rf^)*) C Qi{AdP), then 

'P^gX = -T^A^xFg-Ao- (6.5) 

where vr^ : Q^{Ad P) — * ker(((i^)*) is the L^-orthogonal projection. (Here and below, 
for notational convenience we do not distinguish between a tangent vector to B oi Ai 
at [A], literally a gauge-invariant section of a certain vector bundle over the gauge- 
orbit through A, and the representative of this section at A.) 

We would like to apply these ideas to the situation of a local action of H* x H on 
a neighborhood of a point in A^. Given a concentrated connection A, with scale A = 
X{A) and center point pyi, fix a positively oriented normal coordinate system centered 
at Pa- ^ Near pa it makes sense to speak of the translation, dilation, and rotation 

^The precise definitions of A and pA are not important here. There are several definitions in the 
literature, leading to some arbitrariness in the definition of "near" , "bubble" , etc. In all instances in 
which the differences among these definitions have been carefully analyzed, it has been found that 
these differences do not affect the estimates we need in any material way (cf. [GP2], section 5). 
We will simply assume in this paper that the same is true here, and will freely quote results proved 
using different definitions as if they had been proved using consistent definitions of scale and center. 
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vector fields. These are determined invariantly by data (b, a, a) G Tp^M©R©A^(TA^) 
by setting 

X(b,.,.) = b^^ + (v^A-^)(ax^ A + a,^^^^) (6.6) 

where {x*} are normal coordinates centered at pa and tP , aij are the associated com- 
ponents of b, a. We include the normalization factor (v^A)^"*^ to arrange Ilij^-Fyilh ~ 
const (independent of A); see Proposition 6.4 below. We call cnjX^-^ a self- dual /anti- 
self-dual rotation vector field if Uijdx^ A dx^ is a SD/ASD two-form at pa- 

Since the X are only defined locally, we extend them to A^ by cutting them off 
outside a small ball. For this purpose X as above we define X = f3X, where /5 is a 
cutoff of scale 

e = 4noA^/l (6.7) 

Here no is a constant taken large enough to ensure that [3 can be used in the gluing 
constructions of [DK], but for all of our other applications we can ignore Uq. For 
convenience we take /3 = /?std(^yi/e)) where r^ is distance to pa and /3std is a cutoff 
function with support in [0,2], identically 1 on [0,1]. (These cutoffs, which will be 
used for the rest of this paper, are different from the ones in §3.) 

We define 

i)A = {^{b,a,a) = /3X(b,a,a) G T{TN) \ (b, fl, «) G T^^M © R © AliTN)} (6.8) 

It is worthwhile to observe that the definition of (A)SD rotation vector fields is 
necessarily a local definition, since globally a nontrivial exact 2-form cannot be SD or 
ASD on an orientable compact manifold. In fact on S*^, rotations that are SD at one 
pole are ASD at the other. This is most easily seen by using stereographic projection 
to identify S*^ — {oo} with R^, then with H. Left-multiplication by unit quaternions 
induces SD rotations near 0, while right-multiplication induces ASD rotations near 0. 
But coordinates near oo on S*^ are related to those near by quaternionic inversion 
(the orientation-preserving map x \-^ x^^), which interchanges the roles of left- and 
right-multiplication. 

When A is ASD, we make the following definition. 

Definition 6.1 The approximate tangent space Ha at A to the moduli space is the 
space 

{Xa := txFA I X G [)a}- (6.9) 

We usually write simply X and leave the A-dependence implicit. 

To justify this terminology, we consider the action induced by such X on an ASD 
connection. Since the X's are nearly conformal vector fields, one expects the induced 
flow to map an ASD connection to a nearly ASD connection. Proposition 6.4 below 
shows that this is the case, and more-but first we need a definition and a lemma. 
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Definition 6.2 Given /t, z/, Aq > 0, let J^'^+i Aq *- ■^'k+iM denote the subset of 
instantons [A\ obeying the conditions 

(i) the first eigenvalues of the Laplacians {d^)*d^ on zero-forms, d'^{d'^) * on SD 
two-forms, are both greater than z/, 

and 

(ii) for all p e N, 

\FA{p)\<CXyiX' + rA{pf)' + K, (6.10) 

where Fa is the curvature of A, ta^p) = dist{pA,p), and A,^^ are the scale and center 
point of A, respectively. 



The pointwise bound (6.10) essentially says that \Fa\ is bounded by the curvature 
of a standard instanton plus a contribution n from a background connection. At small 
distances from pa, the latter term is negligible, but far from pa the background term 
dominates. 

When dealing with estimates for the approximate tangent space, one must decide 
at what scale e to cut off the vector fields in [)a- If one takes e too small, the 
derivatives of the cutoff function become inconveniently large, while if one takes e 
too large, the contribution from the background connection swamps the contribution 
from the concentrated curvature. If we require that e scale as a power of A, we get the 
optimal balance between these undesirable features only if e ~ A^^^. Earlier we chose 
e = 4noA^'^, and we now take uq large enough so that in the gluing construction 
of [DK] one is assured of landing in the domain of Taubes' contracting mapping 
argument. Once no has been so chosen, it (like k) is simply another ignorable constant 
for the computations we need in this section. In particular, note that on the ball 
B{pa,€) or radius e centered at pA) we have A^/(A^ + r^)^ > (Ao + 64no)"^ > const •«. 
Hence, with a new constant c = c{k), 

\Fa\ < cAV(A^ + r\f on supp(/3). (6.11) 

This enormously simplifies our computations. 

The next lemma shows that we can always arrange the fiber Z to lie in some 

-'^'fe+l.Ao- 

Lemma 6.3 Given small Aq, let T : M.'k^i x ~^ -^k be the projection sending a 
concentrated connection to a "background" connection. Let [Aq] G Ai';. and assume 
that the first eigenvalues of the Laplacians d\^dAo, c^Ao(^Ao)* ^^ ^'^ P-valued 0-forms 
and SD 2-forms, respectively, are positive. Then there exist Aq > 0, z/ > 0, k > 0, C > 
0, and a neighborhood U of [Aq] in A4k+i such that T~^{U) C -A^fc+i Aq- 
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Proof: That condition (i) in the definition of -M-I'^i x can be satisfied follows 
from the proof of Lemma 7.1.24 in [DK]; that condition (ii) can be satisfied follows 
from modifying several ideas in [GP2] (Definition 4.1, Lemma 4.3d, and Propositions 

4.4). ■ 



Henceforth we will always assume that instantons [A] lie in a fixed Ai'^'". For 
such connections we have the following. 



Proposition 6.4 Fix k,u. Let tva : n\Ad P) -^ H\ := ker{{d^)*)f]keT{d^) (nat- 
urally identified with the tangent space T[a]-M.) be the L"^ -orthogonal projection. For 
all sufficiently small, positive S there exist c, ei(A), such that if A E A^^+i^Ao then 



X, 



(h,a,a) 



in 



'(|b|^+ \af + \a\ 



<ei(A)(|b 



2 I I |2 I I |2\ 

+ \a\ + \a\ ) 



where ei(A) ^ as A ^ 0, and 



X -tiaX 



<cA^(|b|A + (|a| + |a|)Ai/2^. 



(6.12) 



(6.13) 



Proof: The proof of the first statement is similar to that of [GPl, Proposition 
3.6]; we omit the details. We prove the second statement later as Proposition 10.8(b). 



Thus by taking A small enough we can ensure that tta '■ "Ha -^ H\ is injective. 
Let Oq C 1)a be an open neighborhood of zero. For X G [)a, let A^ denote the 
connection that results from acting on A by the canonical fiow of X for unit time, 
and let Oa = {A^ \ X G Oq}. Proposition 6.4 has two implications, once we take 
Oo small enough. First, Oa lies in a neighborhood of the ASD connections on which 
Taubes' contracting-mapping argument lets us "project" the image of A to an ASD 
connection. Second, by the implicit function theorem, the image of Oa in M.k+i is 
an 8-dimensional sub manifold of M.k+i- 

The quantity X — ttaX will be central to the definition of the remainder terms in 
/^ioc(p) and to the analysis in §10. We define 



e 



X 



X -tiaX 

d'^G^id^yx + {d^yc^d^x 



(6.14) 



Here G^ and G^ are the inverses of the Laplacians {d^yd"^ and d^{d^y on Vt^{Ad P) 
and VL\{AdP), respectively. 
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We make three observations here. First, if A is ASD, ||((i:^)*G'^(i^X||2/||X||2 is 
small for any rotation vector field, not just SD ones. This is to be expected since any 
rotation vector field is an approximate isometry and hence should approximately pre- 
serve anti-self-duality. However ||(i^G^((i^)*X||2/||X||2 is small only for the rotation 
vector fields of duality opposite to that of the connection. Second, to deduce from 
this smallness that the parameter space injects (locally) into B, one must know that 
the first eigenvalue of the Laplacians on zero-forms does not tend to as A tends to 
zero, as it will if the "background" connection is flat (or merely reducible). Indeed 
on Aii{S'^), all rotation vector fields (not cut off), lifted as above, preserve the stan- 
dard instanton. (On Aii^Rf^), if one writes instantons in the usual gauge and instead 
lifts rotations using the flat connection, then ASD rotations preserve the standard 
ASD instantons centered at the origin, while SD vector flelds induce the effect of a 
gauge transformation.) Third, because of the cutoff P, f)^ is not a Lie subalgebra of 
r(TiV), although in some sense it is close to being one. Thus, while intuitively i)A is 
associated with the Lie algebra of an 8-dimensional group of translations, dilations, 
and rotations, Oa is not quite the orbit of an 8-dimensional local Lie group, hence 
the term "almost-action" . 

We will return to this point at the end of this section, but flrst we wish to relate 
Oa to the gluing construction in [DK]. The flbration of a region in A^'^^^i x '^^^^ -^fc 
is usually viewed in terms of center point, scale, and gluing parameter. We claim 
that on an inflnitesimal level these are essentially the eight parameters used to deflne 
the approximate tangent space. Indeed [GP2, §5] it was shown that lifts using the 
translation and dilation vector flelds do correspond to inflnitesimal changes in center 
point and scale, up to an error that is essentially 0(A). ([GP2] dealt only with TWi, 
but under a suitable deflnition of "concentrated" the same argument works more 
generally). It remains to identify our action of S0{3) (the SD rotations) with the 
"gluing parameters" of the construction in [DK, §7.2]. As both constructions are 
non-canonical we content ourselves with a somewhat heuristic correspondence. 

Instantons in the subspace J^l'^i x have a single "charge-one bubble" and are 
otherwise not concentrated. For any such ASD reference connection Aq = A there 
exists a gauge over ball B{pa, K\) such that after pulling back to 5(0, K\) C R*^ 
by a positively-oriented normal coordinate system {x*}, the connection form is close 
to A^x'^, the standard instanton on R*^ of scale A and center the origin. (See [DK], 
§8.2.1.) Here i^ > 1 is any flxed number and "close" means that after dilating by A, 
the two connections are C^-close on -8(0, K) C R^; the undilated connections satisfy 
|(V-'(A — A^*'^)! < ciA^^"-', < j < 2, where by taking Aq small enough we can take ei 
as small as we please. After a choice of normal coordinate system, the identiflcation 
R^ = H, and an identiflcation of SU{2) with the unit quaternions, the connection 
form for Af'^ on our ball is 

lm{xdx) 
^0 = \2 I I 12 - ^-^^ 
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For integers j = 1, . . . , 10 define the sets Uj = B(pA,jno\^^'^) and Vj = N — Uj. 
Also let let f/oo denote the annulus f/io f] Vi and let Q denote the smaller annulus 
U9 n ^2- We choose gauges sq, Sqo (local sections of P) over Uio, Uoo respectively such 
that the transition function between Uiq and Uoo is gooo{x) := x/\x\ (i.e. Soo = sogooo)', 
furthermore we take sq to be the radial gauge for A with respect to pa in which (6.15) 
is written. The connection form for A^^'^ with respect to Sqo on Uoo is then 

XHm.{xdx) 

To make contact with the construction in [DK], we will pretend that on the ball 
Us, our connection A is exactly standard (so that the connection form relative to sq 
on f/g is (6.15)). Let /3 be a function that is identically 1 on A^ — Uoo and identically 
on fi, with |V/3| < cng ^A~^/^. (Note that the "interior" part of the support of /? 
occurs where P = 1.) On Uoo let u'^ denote the connection form (3uj'. We then define 
a new connection A' on P by declaring the connection form for A' in the gauge Soo 
over Uoo to be uj'^, and declaring A' = A on N — Uoo- We think of A'\y^ as a cut-off 
"background connection". In fact, we can define a bundle Pk of Pontryagin index k 
by replacing the transition function gooo by the identity; A'\vg extends to a connection 
on Pk that is fiat near p^- Note that over Q the connection A' is fiat; its connection 
form there, relative to Sq, is gooodg^^ = lm.(xdx)/\x\'^. 

Our choice of normal coordinates and identification R"' = H induces a Lie algebra 
isomorphism 6 : A^T* M — > su(2) = Im(H), mapping the standard basis of A^Tq*R^ 
to {i, j, k}. (Alternatively, 6~^ is given by mapping v G su(2) first to the vector field 
induced by quaternionic left-multiplication on H, then to the two-form obtained by 
lowering an index using the metric.) Let v = 6{a) and assume \v\ is not too large. 
We consider the canonical fiow of X = (SaijX'^-^ acting on the cut-off connection A' . 
After integrating the flow for time 1, the action on the base is a; ^^ 'P{x) = hi{r)x, 
where r = |a;| and where hi{r) = exp{j3v). Let A'^ be the connection determined by 
this integrated canonical flow. 

An alternative flow, the "so-flat" flow, is obtained by lifting X to P\us using the 
flat connection determined by Sq, and extending this flow to the complement of f/g 
by the identity (since X is supported in f/g). If we integrate the Sg-flat flow for time 
1, the resulting connection form on Q with respect to sq is 



, lm(hi(r)xd(hi(r)x)) , _, ,, _, , „, /^ -, ^,^ 

t^ V = n^ = gooohi a/iifi-ooo + gooodgooo- (6-17) 

By our earlier comments, the connections resulting from the canonical flow and the 
So-flat flow are gauge equivalent (and in fact are equal outside f/g). Thus A'^ is gauge- 
equivalent to a connection A" equaling A' on N — Uoo, and whose connection form in 
fl (in the gauge sq) is (6.17). 
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We claim that the connection A" is the one constructed in [DK, p. 296]. The 
latter essentially begins with the connection A' (thought of as a cut-off connection 
on Pk\vi glued to a connection on a /c = 1-bundle Pi\uio) ^^^ modifies it on Uoo as 
follows. Let hi{r) be as above, let h2{r) = exp(— (1 — f3)v) and consider the two 
gauge transformations /ii,/i2 over Uoo given by hi{soo{x)) = Soo{x)hi{r). The gauge 
transformation hi does not extend to all of P (unless exp(f) = ±1), but it does 
extend to the bundle Pk defined earlier, and for r < 4noA^/^ changes the trivialization 
Soo (extended to Pk) by the constant exp(t;). Similarly the gauge transformation /i2 
extends to P\uuv changing the trivialization s^o for r > 8noA^/^ by exp(— f). Because 
hi^dhi = h2^dh2, the two gauge transformations have the same effect on the flat 
connection A'\q. Therefore we can define a new connection A^^ by 

,DK _ f hiA') on f/g 



The connection form for A^^ with respect to Sqo on Q is h^^dhi = /i^^(i/i2, so 
with respect to sq the connection form is precisely (6.17). Thus A" and A^^ coincide 
on Q. Since /ii = 1 on Vg we have A^^ = A' on this region, and since X = on Vg 
we have A" = A' there as well. Thus A" = A^^ on V2. It remains to consider only 
1/2- On this ball, a computation shows that the connection form for A' relative to sq 

is _ / 9 

, lm.{xdx) f ~ A 



On U2, we have /i2 = 1, so the connection form for A^^ remains uj'q. But the connec- 
tion A' is also preserved by the "sg-flat" flow of X; replacing x by hi{r)x in (6.19) 
does not change ujq, since hi is constant on f/2. Therefore A" = A^^ over all of A^. 

Now let Ay be the connection obtained from applying the canonical flow of X for 
time 1 to A (rather than to A'). The preceding shows that up to gauge equivalence, 
when \v\ is not too large the only differences between A^ and A^^ arise from the 
facts that (i) A is only approximately standard on a small ball B{pA,KX) rather 
than exactly standard on the larger ball B(pA, lOnoA^^^), and (ii) we do not cut off 
Ay before applying the flow. 

It should also be noted that since the subspace [)™* corresponding to the SD 
rotation vector flelds is not closed under Lie bracket, if we let f)™* act on A by the 
canonical flow for time 1, we should not expect to get a closed "orbit". But the 
construction in [DK] shows that the space of gluing parameters is a copy of 5*0(3). 

To address this discrepancy, flrst note that if A^ were R^ we could dispense with 
the cutoffs in the deflnition of i)ji. The vector flelds would be globally deflned, and 
would generate a Lie algebra exponentiating to the group of motions of R^, 

{x^ax + b\{a,b) eH* xU}. (6.20) 
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The stabilizer of the origin would be H* x {0}, and if the initial connection A were 
standard, the set of connections generated by letting SU{2) C H* act via the canonical 
flow would give two copies of the space obtained by the construction in [DK], as (a, 0) 
and (—a, 0) yield the same connection. (Alternatively, if f 7^ is small enough, the 
connections A' = A^^ and A" = A^^ are gauge-inequivalent, because the gauge 
transformation hi defined earlier — which always extends to -P|Ar_{p^} — extends to P 
if and only if exp(f) = ±1.) From our earlier discussion the action of (a, 6) on the 
standard instanton is given by pulling back the connection form by the inverse of 
{a,b), which results in a connection of scale \a\ and center b (cf. (8.23)). The unit 
quaternion a/\a\ corresponds to gluing angle, doubly parametrized. 

Intuitively, then, we have the following picture. Fix a reference connection A = 
Aq G -Ml+i A • Let B C su(2) be the ball centered at the origin that is carried 
diffeomorphically to SU{2) — { — 1} by the exponential map, and let B' C 1)^4* be the 
corresponding set of SD rotation vector fields. If we let the canonical flow of elements 
in B' act for time 1 on [Aq] we obtain a space that (for purposes of integrating 
reasonably behaved differential forms) approximates two copies of the fiber .^dk- 
This correspondence becomes sharper as Aq — > 0: as we take the limit and rescale the 
(local) metric and normal coordinates correspondingly, the failure of ()^ to close under 
Lie bracket disappears on any ball of fixed rescaled size. Furthermore, because the 
rescaled metric becomes fiat, the limiting space of vector fields i)A is the same whether 
we center the rotations and dilations at pa or at p. Thus the limiting action of H* x H 
above appears to generate an immersed manifold that we can treat "homologically" 
as two copies of ^dk- This discussion motivates the assumptions we make on Z in 
the next section. 



7 The fiber Z. 

For our purposes we need only consider one fiber Z = Zx^ of the projection A1'fc_|_i Xo ~^ 
Ai'f.', we do not need to construct the whole fibration. We will assume Z has the 
following five properties. The first three are known to be satisfied by .Z'dk, so the key 
assumptions are really the last two, which require the tangent spaces of Z and Z^k to 
be close in various norms. The assumptions can almost certainly be weakened from 
those below, at the cost of considerably more technical work. 

(Zl) Z fibers over A^ via the projection Z -^ N sending a concentrated connection 
to its center. Given f/ C A^ we let Z|^ denote the inverse image of U under the 
projection. We assume that A^ can be covered by a finite number of normal coordinate 
charts Ui (which we may take to be geodesic balls) such that for each i there is a 
two-to-one fiber-preserving covering map Pj : H^ x Ui -^ Z\u. having additional 
properties listed below. Here H^^ = {a G H* | \a\ < Aq} = (0, Aq) x SU{2), where 
the isomorphism is a ^^ (|a|,a/|a|). (Note that the center-point and scale maps are 
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defined globally on Z; it is only for the purpose of handling gluing parameters that 
we need to chop up A^.) 

In general a normal coordinate system {x^} on f/j determines an identification 
between Ui and a ball in H centered at the origin, and hence a local action of H^^ x H 
on Ui given by ((a, b),x) \-^ ax + b. We assume that on each Ui there is a positively 
oriented normal coordinate system {x^} on Ui such that pj is approximately given by 
the induced canonical flow of this H^^ x H-action, based at the standard instanton on 
R^, in the sense that (Z2)-(Z5) below are true. From [D, §3], the orientation induced 
on the fiber Z by the standard orientation of H x H as a complex vector space is then 
compatible with the standard orientations of A^^+i and Aik (i-e. the orientation of 
-^fc+i A i^ ^^^ product of the orientation of Z and the pullback of the orientation of 
M-k)- These are the orientations used in (1.8). 

(Z2) We assume that for each i, the scale and center point oi A = 'p^{a,b) are 
X{A) = \a\ and Pa = b (in quaternionic normal coordinates) respectively. 

(Z3) Given i, let [Aa.t] = Pi{ci,b) and let Fa^b = Fa^^ ■ We assume that for any 
-ft' > 0, on the ball B{pa, KX{A)) we have 



^ I V48|a| 

a,b\ 



<ei(A)A-^ (7.1) 



(|a|2 + |a;-6|2)2 

where ei(A) ^ as A ^ 0. 

(Z4) Let B be the component of exp^^(S'f/(2) — {—1}) containing 0. A tangent 
vector V G T(^a,b){iiX) ^ -^) &^^^ ^i^^ ^o a vector field on a neighborhood oi b E B that 
determines an element X„ G t)A- Writing X„ = lx^Fa, we require that 

\\-p.v + X4L^<e2{\)\v\ (7.2) 

where 62 (A) ^ as A ^ 0. Observe that because of (6.12), we can alternatively write 
(7.2) as \\-p,v - {-^aX,)\\l2 < e^i^XM; cf. (6.5). 

(Z5) Letting ^(, = fj^v + Xy and ^^ = X^ — tc^Xv, we further require that ^^ satisfy 
the same weighted L^ bounds as C^^ given in Proposition 10.8 ((10.74) and (10.75)), 
and the pointwise bound (10.72). 

If not for (Z4) and (Z5), we would not need to assume (Z1)-(Z3). By itself, (Zl) 
follows from the description of the ends of moduli space in [DK, §§7.2 and 8.2]; we 
simply take the local diffeomorphism (0, Aq) x 5*0(3) x f/j = Z\u., and pre-compose 
with the covering map SU{2) -^ S0{3). Similarly, (Z2) and (Z3) follow from [DK, 
§8.2.1]. 

What is not clear is whether the construction in [DK] yields a fiber whose tangent 
space at [A] is sufficiently close to TiAiTi-A) in the norms required for our analysis. If the 
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subspace i)A C r{TN) were a Lie subalgebra, then by (6.5) the canonical flow would 
generate a fiber whose tangent space at [A] would be precisely TiAi'HA)- However, 
i)A is not closed under Lie bracket, and the canonical fiow of vector fields in i)Ao, 
acting on a single reference connection [Aq], has no chance of generating an orbit 
that reasonably approximates all of Z^dk]', the cutoff in the translation vector fields 
prevents the canonical fiow from moving the center point very far from paq , whereas 
all points in A^ can occur as center in Z^^x]- But the estimates relevant to proving 
Theorem 1.2 are much less sensitive to changing the definition of of translations than 
to changing the definition of rotations and dilations, so it seems plausible that, by 
a patching argument, altering the definitions of only the translation vector fields in 
any significant way, we can splice together canonical fiows based at connections with 
nearby center points. Presumably by splicing enough fiows together we can obtain 
a fiber that is C^-close globally to Z^dx] and C^-close locally to the orbit of some 
canonical fiow. Even if the splicing construction fails, there are two reasons why, for 
purposes of integration, we may not need to define a true fiber (such as Z[dx]) in a 
topological sense. First, when we integrate fi{p) A fi{q) over an orbit of the canonical 
fiow, only connections with center point near p and q contribute significantly to the 
integral. It is likely that the same holds for an integral over Z^jyx], so that it suffices 
to approximate only a region of Z^dk] consisting of connections with center point in 
a fixed small ball. Second, although the canonical fiow of the subspace 1)™* acting 
on [Aq] does not generate a closed manifold, it does generate an immersed copy of 
SU{2) — { — 1} lying in a small neighborhood of an S'0(3)-orbit in Zyj:)^]-, and which 
geometrically wraps twice around this orbit. A careful analysis may show that there 
is a homotopy from the immersed punctured SU{2) to a punctured double cover of 
the 5*0(3) in Z^dk], small enough in all relevant norms that there is only a negligible 
difference between integrating over Z^^k] and over the orbit of the canonical fiow. 

Thus the idea behind (Z1-Z5) is basically that there is fiber that interpolates 
between Z^dk] and the not-quite-fiber generated by splicing together canonical fiows. 
The hypotheses (Z4-Z5) amount to assuming that, in this interpolated fiber, the 
bounds on ^' are as good as they would be if the tangent space to the fiber were the 
one determined by the canonical fiow. We need such an assumption because, when 
we pull fid{p) /\fid{(l) back to Z, we need to insert true tangent vectors to Z into (5.3); 
the TT^X's in the expansion (8.1) below should be replaced by p^,f 's — which has the 
effect of replacing each C, in (8.6) with ^'. 

There is other evidence making the simultaneous satisfaction of at least (Z1-Z4) 
very plausible. On R^, if we remove the cutoffs in the definition of i)A and define Z 
from the canonical fiow acting on the standard instanton, then the spaces T[a]Z and 
T-Ca coincide. In the case of 1-instantons over simply connected definite manifolds 
(where the background connection is fiat and there are no gluing parameters, so Z is 
five-dimensional), (Z4) was shown in [GP2] to be true with e2(A) < c\^~^ for small 
(5 > 0; in [G2] this was strengthened to A^^*^. 
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The technical hypothesis (Z5) is more ad hoc, and stronger than necessary, but 
is not without basis. In the setting of the five-dimensional moduli spaces men- 
tioned above, certain estimates of this paper and [G2] can be combined to show that 
Ikp'^^'IU < A-^+^(6 ■ A + a ■ A^/^) and WtaCU < b ■ \^{b ■ A + a ■ A^/^), much stronger 
than the L^ bounds assumed in (Z5). (Here Vp denotes distance to an arbitrary point 
pe N.) 

An important implication of (Z4) and (6.12) is the following. Let dvol^ be the 
volume form on Z induced by the L^ metric on B. (Hypothesis (Zl) determines 
an orientation on Z, so there is no sign ambiguity here.) Let a e R^ denote the 
quaternionic variable in H^^. Then 

p*{dvolz ) ~ const ■ d'^a A dvol^ = const ■ \^d\ A dvols-i A dvol^- (7.3) 

where the approximation becomes exact as A — ^ (and the constant is of course 
nonzero). Our chief use of (7.3) will be to help estimate the integrals of the non-local 
terms in jidip) ^ f^dil)- For this purpose, we don't actually need "~" in (7.3); "< 
would suffice. Thus hypothesis (Z4) can be weakened. 



8 Localizing i^dip) A i^diq)- 

From now on we assume there is a fiber Z with properties (Z1-Z5). To motivate 
the leading-term calculation in (1.10), suppose for the moment that for [A] G Z the 
tangent space T[a]Z is precisely, rather than approximately, the space TiAiTi-A) C 
T[^]A1. Then if we pull fid{p) back to Z, we need only apply (5.3) to arguments of 
the form tta^a with X G f)^. Recalling the definition of ^x in (6.14), we then have 

G^{n^X, n^Y} = G^ ({X, Y} + i?emi(X, Y; A)) , (8.1) 

where 

Rem,{X,Y) = {X,^y} - {Y,^x} + {^x,^y}. (8.2) 

(We omit writing most of the A-dependence in these formulas explicitly.) Here {■, ■} is 
a universal, local, antisymmetric bilinear pairing that takes two Ad P-valued 1-forms 
and produces an Ad P-valued zero-form. Note that Rerrii {X, Y) is antisymmetric in 
X and Y. 

In [Gl, Proposition 2.1], it was shown how to expand several of the expressions 
appearing in (8.1-8.2) as a leading-order local term plus a non-local remainder, smaller 
in appropriate norms. In particular, for any vector fields X, Y on A^, we have 

Gt{X, Y} = -^-F{X, Y) + G^(P"(X, F)), (8.3) 
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where F = Fa and where 

2R"{X,Y) = n{F){X,Y) + F{AX,Y) + F{X,AY) 

- 2(VfF)(V,X,F) - 2(VfF)(X, V,F) - 2F(V,X, V,F). (8.4) 

Here TZ is an endomorphism proportional to the Riemann tensor whose precise form 
does not concern us. As a consequence of (8.3), R"{X, Y) is antisymmetric in X and 
Y. The precise way in which the derivatives of F and the derivatives of X and Y are 
hooked together in (8.4) is critical for certain estimates (Lemma 10.4). 

Applying (8.3) to the first term in (8.1), we find 

G^{7rX, nY} = -^ (F(X, Y) - Rem^iX, Y)) , (8.5) 

where 

^Rem2{X,Y) = G^{R"{X,Y) + Remi{X,Y)) 

= Gt (i?"(X, Y) + {X, ^y} - {r, ix} + {ix. iv}) 

:= G^{Rem'^{X,Y)). (8.6) 

Inserting all this into (5.3) we find 

/i,(j9)(7rX,7rf,7r\/,7riy) = 
= ^tr ( (F(X, Y)F{V, W) + F(X, V)F{W, Y) + F(X, W)F{Y, V))\^ 

+ Rem^{X, F, V, W\ + Rem,{X, F, V, W\ , (8.7) 

where 

Rem^ = const ■ tr(F A Rem2), Reiriii = const ■ ti{Rem2 A Rem2). (8-8) 

(In (8.8) we regard F and Rem2 as r(A(i P)-valued two-forms on the space of vector 
fields.) The first term in (8.7) is just {STr^yhviF A F)(X, Y, V, W)\p, which, since F 

is ASD, can be rewritten as {8'K^)^^\F\'^dvol{X, Y, V, W)\p. Thus if we define 

fi,,,{p){nX,nY,nV,7iW) = ^\F{p)\^dvol{X,Y,V,W)\p, (8.9) 

then (8.7) simplifies to 

fidip)inX,nY,nV,TvW) = i^iocip)inX,nY,nV,nW) 

+Rem3{X, Y, V, W) \p + Rem^iX, Y, V, W) \p. 

(8.10) 
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Of greatest concern to us will be the local part /iioc(p) of this expression. Note 
that /iioc(p) A /iioc(p) = 0, since dvolp A dvolp = 0. However, we will see that 
limg_»p /^/iioc(p) A fiiocig) 7^ 0. In this integral it turns out that instantons of 
scale ^ dist(p, g) give the main contribution to the integral. Thus the pullback of 
fidip) A fj,d{p) to Z can be thought of loosely as a (5- form concentrated on instantons 
of scale zero. 

To integrate fid{p) A fidW) "we must still worry about the non-local remainder 
terms Rerrii, as well as the fact that the tangent space T^j^Z is not precisely tt/Ha- 
We will see later that, as Aq ^ 0, the contributions to the integral of /^d(p) A /Xd(g) 
over Z = Z\^ from both of these corrections tend to zero. What we wish to compute 
now is 

lim / /iioc(p) A/iioc(g), (8.11) 

where p and Aq are fixed. 

For given p, g, as we integrate fiiodp) A/iioc(g) over Z, the center point pa of [A] in 
Z moves around, affecting the support of the vector fields X, Y, V, W in (8.9). Thus for 
/^ioc(p)A/iioc(Q')(7rXi, . . . , nXs) to be nonzero, pa must lie in B{p, 8noA^/^) f] B{q, 8noA^/^). 
In particular we can restrict pa to a small normal-coordinate ball U centered at p 
(which we can take to be one of the f/j in (Zl)) without affecting f^ /iioc(p) A fJ,ioc{q)- 
Since we are interested in the limit as q ^ p, we may also assume q & U. 

Let 2L = dist(p, q); we will later send L to zero. Define Zi G Z to be the set of 
instantons in Z obeying the two criteria 

PAeBip,no\'/^). (8.13) 

Note that if [A] E Z\ thenp^ ^ -B(g, (no + l)A"'^/^), so that the cutoff /9 in the definition 
of the vector fields Xj equals 1 at both "p and q. We will see later that the contribution 
to (8.11) from the complement of Z^ is negligible. 

Let {a^oi^j} denote normal coordinates on \J . We change coordinates by setting 
Xnew = -^"^a^oid and replace the metric g^xA on U by (^new = -^~^5'oid- Because of 
the conformal invariance of \F^dvol, /iioc(p) A /iioc(g) is unaffected by this change. 
However, since A = Aqm represented a distance in the old coordinate system, we 
now have a rescaled upper cutoff for Anew = -Z^~^Aoid on Z, namely Ao,ncw = Ao/Iv. 
Measuring all distances in the new metric, the defining conditions for Z\ become 

i^"°-' > Anew > L, (8.14) 

PA^B[p,NL-^l^\]ll). (8.15) 

As L — i> 0, several things happen. For A E Z, \Fa\ becomes approximately standard 
on any fixed ball B{p,K); g^cw approaches the flat metric X]('^^new)^j and (in the 
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rescaled metric and coordinates), Z\ becomes an S'0(3)-bundle over monotonically 
increasing regions of center-scale space that exhaust (0, oo) x (R^ — {0}) as L ^ 0. 
Because of (Zl), we can identify Z\ with ever-increasing subsets G^ of G := (H* x 
H)/Z2. Letting fi[^^ denote the pullback of /iioc to H* x H, we therefore have 

lim / /iioc(p) A /iioc(g) = - lim / fi^^ip) A fi^M^ (8-16) 

provided this integral converges. 

Let p be as in (Z1-Z5). Write elements of G as pairs {a,b), and write A(a,b) = 
p{a, b), F^a,b) = -^A(„,6) as in (Z3). If we define fi[^^ = p>ioc e ^"^{G), then 

f d d 

/^ioc(p) A/iL(9) = / /^ioc(p) Apioc(9)(^, • • • , -g^)da^^- ■ ■Ada^Adb^A...Adb'^. 

(8.17) 

To compute this we need to know 'p^:,^^^^^/^a\'f>^,^^'.^/^b^. At each (a, 6) define 
Xj, Yi to be the vector fields on R^ induced by d/dd\ d/dU respectively. Temporarily 
writing V = a*^^ and Yi = Xj+4, from (Z4) there is an 8x8 matrix C = Id + 
0(e2(Aoid))) for which we have 

P*(a,b)C^id/da^ = -vrA(„_,/x,i^(a,b)- (8.18) 

Hence from (8.10), if not for the correction matrix G we would have 
d d ^ ^ 

/^Joc(p)(^> ■ ■ ■ ' 'd^^ ^ /^loc(p)(7rA(„.,)Xi^, ■ ■ ■ , T^A^^^.^X^) 



iTX 



2\-2| 77 /„m2 



FA{p)Vdvol^{Xi,...,X^), (8.19) 



with a similar formula if we replace any of the d/da^^s by a d/dU . 

Let us ignore, for now, (i) the 0(e2(Aoid)) = 0(e2(LAncw)) difference between the 
matrix G and the identity, and (ii) the 0(|a;oidP) = 0{L'^\x^e-^\'^) difference between 
the true metric on the rescaled ball and the flat metric; we will make the corrections 
later. Since the Euclidean volume form is dvol = dx^ A ■ ■ ■ A dx'^, we will write 
dvolp = d'^Xp, dvolq = d'^Xq below. Hence 

f^Up) A f^'ioMig^, • • • ' 96^) = {8n')-'\FA{p)\'\FA{q)\'d% A d%{X,, . . . , Y,). 

(8.20) 

So far we have treated d'^XpAd^Xq as an 8-form whose arguments are vector fields, 
but we may as well consider it as an 8-form on the 8-dimensional space TpM © TgN. 
Using the canonical isomorphisms TpIV^ = T^R^ = R^ and our further identification 
of R^ with H, we can write each Xj, Yj in the form {v, w) G H © H. In the coordi- 
nate system {a;^^^}, the origin represents p, and we may assume that q hes on the 
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real axis with coordinate 2 G H. Let ri = 1 G H and let {tj}! be the quaternions 
i,j,k. Then Xj(x) = Tia^^{x — b) and Yi{x) = Ti. so the corresponding elements in 

TpR^©T,R4 = H©H are X,' = {-Tia-^b,Tia-\2-b)) and F/ = {Ti,Ti). Modulo the 
span of the F/, we have X'^ = (0, 2ria~^) := Xf , so rf^XpArf^a;g(X;, . . . , X^, F/, . . . Y^) = 
d^Xp A d^Xq{X'l, ..., X'l, 17, . . . F4'). Since d'^Xp{Xl', *, *, *) = 0, it follows that d'^Xp A 
d%{Xl . . . , X^', F/, . . . Fi) = d%{Yl, ..., Yi) d%{Xl ...,X'l). But d%{Yi, ..., Y^) 
1 and d'^Xg{Xi, . . . , X4) = 2^|a|~*^. Hence 



/^L(p)/\/^ioc(9)l(a,fe)( 



9^ 



A- 



^7r2)-2|F(,,5) 



|i^(a,6)(2)|^24|a 



f8.211 



where -F(a,b) is the curvature of the instanton obtained from the action of (a, b) on a 
reference connection in our fiber. Therefore 



^loc(p) A /^loc(9)l(a,fe) = 2^(87r2) 2| 



\a,b) 



F(„,6)(2)|>|-VaArf46. (8.22) 



Because Aq is small, there is a reference connection in our fiber that looks approx- 
imately standard on a ball of any fixed large radius, with the approximation getting 
better as Aq ^ (the rescaling by L only improves this approximation). Our next 
approximation is to ignore the difference between the true reference connection Aq 
and the standard instanton; we will deal with the error later. The connections in the 
limiting Zi are then the orbit of the standard instanton Ai under the action of G. 
Hence 



\F(a,b)\'^{x) 



K\)^aA'{^) 




rf(a-i(a;-6)) Ad{a-'^{x- 


-b)) 


(l + |a-i(a;-6)2)2 





48|a|4 



(|a|2+|a;-6|2)4- 



Thus 



I J, := lim 



KoM ^ f^'ioM = 367r 



-4 



2^ \a\* d*aAd% 



U.ni\a\'+Wn\a\' + \2-b\r^ 
and provided the error terms we have so far ignored are truly ignorable, 

1 , 



L^O 



lim / /iioc(p) A/iioc(g) 



(see 8.16). 



(8.23) 



(8.24) 



(8.25) 



Lemma 8.1 /„ 



1. 
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Proof: First introduce spherical coordinates in a-space (with radial variable we 
call A) and cylindrical coordinates in 6-space (with radial variable r). The integrals 
over the 3-sphere in a-space and the 2-sphere in the imaginary subspace of 6-space 
are trivial, contributing factors 27r^ and Att respectively. Thus 

^oo ^oo ^oo A'^r^ 

A=0 Jr=0 A=-oo (A2 + r2 + ^2)4(p + ^2 + (^ - 2)2)4 

(8.26) 
Introducing polar coordinates in the X-r quarter-plane, the angular integration re- 
duces us to an integral over two real variables. Using the Residue Theorem to inte- 
grate over z leaves us with a one-dimensional integral that can be computed in closed 
form, yielding Ip = 1. | 



In the local calculation we ignored errors from four sources: (i) the contribution 
from the complement of Zi, (ii) the difference between the flat metric and the true 
metric on the rescaled ball; (iii) the difference between p^v and — TTy^X^ (i.e. the 
difference between the matrix C and the identity); and (iv) the difference between 
\F\a,b and the standard instanton of scale \a\ and center b. 

Let us first deal with (i). Since the vector fields Xi we feed into /iioc are cut off 
at distances > 2noA^/^ from pA, the integrand piodp) A /iioc(q')(^i, • • • ,^8) vanishes 
for Pa outside the ball B{p,2noX^^'^). For purposes of integration we therefore need 
only that portion of Z lying over a ball of fixed small radius centered at p. Because of 
(6.11), the integrand over such a region is bounded by a constant times the integrand 
we used in our previous calculation, cut off in certain regions. Since the integrand 
in (8.24) is integrable over all of H* x H, given any exhaustion Wi C W2 C ... of 
H X H, the integral over the complement of Wn goes to zero as n — * 00. As the sets 
Gl provide such an exhaustion, the integral of p\oc{p) A /iioc(9) over the complement 
of Zi tends to zero. 

Next we turn to the errors (ii)-(iv) listed above. In place of the set Zi consid- 
ered in the derivation above, for i^ > consider the sets Zk,l defined by {L^'^ > 
A^/^ > L, Pa ^ B{p, K\)}. After rescaling by L as before these conditions become 
{Anew > L, Pa & -B(j9, i^Ajicw)}- This time as L — i> 0, Zk^l exhausts Zk^ := {pa G 
B{p,K\icw)}, with Anew unrestricted. But convergence of the integral Ip implies that 
given €3 > we can choose K large enough and L small enough that the integral of 
the integrand in (8.24) over the complement of Zk,l is less than e^. On the interior set 
Zk,l, hypotheses (Z3-Z4) imply that given €4 > 0, by taking L sufficiently small we 
can arrange for the ratio of the true p'lodp) A/iioc('?) to be within a multiple (1 + 64)^^ 
of the integrand in (8.24) over all of Zk^. (Error (ii) gives an O(Aoid) < 0(L°-^)) 
contribution to £4; error (iii) a contribution e2(Aoid) < e2(-^°'^), through the matrix 
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C. As for error (iv), in the rescaled metric and coordinates (Z3) implies 



IF ^1 

r a,b\gn 



(|a|2 + |x-fe|2)2 



< ei(Aoid)A-t, (8.27) 



so that this error gives a contribution ei(Aoid) < ei(-^°'^) to 64.) Hence we can arrange 
for the integral of f^ f^'iodp) ^ Moc(?) over lie within 63/2 of the integral over Zk^^ 
of the integrand in (8.24). It follows that by choosing L small enough, the errors 
introduced by our approximations can be made arbitrarily small. 

We have now proven the following. 
Proposition 8.2 For any p E N , 

lim / /iioc(p) A/iioc(g) = -. (8.28) 



9 The nonlocal terms in jJ^dip) A fidiq)- 

From (8.8-8.10), fJ.d{p) /\ l^dil) can be expanded as /iioc(p) A/iioc(q') plus a remainder. 
Our next task is to show that, as Aq — * 0, the contribution of this remainder to 
Jz f^d.{p) A /id(q') tends to zero. This will follow from the next proposition, whose 
proof occupies the remainder of this paper. 

Proposition 9.1 Let Q be the restriction to Z\^ of Qm := jidip) A /id(g) — /iioc(p) A 
f^iocW) G ^^{Ai). Assuming (^Zl-Z5^, there exists S > such that 



n < const A^, (9.1) 

where the constant is independent of p and q. 

Observe that Propositions 8.2 and 9.1 together prove Theorem 1.2. 

Proving Proposition 9.1 requires some bounds on Rem2{X^Y) for X, F G \)a- 
Before starting to derive these, we need some notational simplification. Below we 
will be computing many things that are multilinear in data of the form (b, a, a) G 
Tp^N © R © A^Tp^N. Given a single vector field X constructed from such data, we 
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can denote the defining data of (6.6) by (bx, ax, o:x)- Tliis notation becomes cumber- 
some, especially wlien computing objects tliat involve more than a single vector field. 
However, because |X(b,a,a)| < c(|b| + (|a| + |Q;|)A~^ryi), the a and a data always enter 
our bounds with precisely the same weight, so for shorthand we will generally lump the 
a and a terms together, and simply call them a. Furthermore, for simplicity we will 
often omit the subscripts X,Y, . . . in the defining data (bx, ax, ctx), (by, ay, ay) . . .; 
the dependence on X,Y, . . . can be reconstructed from context. E.g. if we write 

[something bilinear in X, F G 1)a| < Ci6^ + C2&a + c^a^, (9.2) 

then on the RHS the notation has the following meaning: 

6^ = |bx||by|, 

ba = (|bx|(|ay| + |ay|) + |by|(|ax| + |ax|)) , 

a^ = (|ax| + |«x|)(|ay| + |ay|). (9.3) 

If the bilinear quantity is antisymmetric in X, Y (as in Theorem 9.2 below), then the 
estimate factors through the wedge product, in which case we can take 

6^ = |bxAby|, 

ba = (|bx|(|ay| + |ay|) + |by|(|ax| + |«x|)), 

a^ = (|ax||ay| + |ttx||ay| + |ttx A Q;y|) . (9.4) 

Finally, the notation x < y means x < cy for a constant c that is uniform in all 
relevant parameters. 

With this notation in mind, we have 

Proposition 9.2 (a) There exists S > such that 

WRem^iX, Y)\\^ < \-'+\b^ + ba ■ X'^'^ + a' ■ \-^l^). (9.5) 

Furthermore, there exists Ci > such that, for r^ > CiX^^'^ , we have the pointwise 
decay 

\Rem2iX,Y)\ < r^' {b^ + ba ■ X''^^ + a^ ■ \~'/^) . (9.6) 

(b) Let v,Xy,C,v,Cv ^^ '^^ ^'^ hypothesis (ZB) of %7. If we alter the definition of 
Rem2{Xv,Xyj) by replacing ^^ with ^'^, then the bounds above still apply. 

We will prove Proposition 9.2 (actually a slightly stronger version) in §10. Let us 
assume it for now and move onto its application, the proof of Proposition 9.1. The 
decay estimate (9.6) is crucial in this proof; the global bound (9.5) does not suffice. 
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Proof of Proposition 9.1. By hypothesis (Zl), /^^ < J2iP*i^- Since Q G 
n^{Z) we can write fi = / dvol^ , where the function / can be computed at [A] E Z 
from any (positively) oriented L^-orthonormal basis 771, . . . , 773 of T[a]Z by 

f{[A]) = n{r^,,...,r^s). (9.7) 

Similarly, we define /'([A]) = Q^iVi^ ■ ■ ■ ^Vs)' where the {rjl} are an orthonormal 
basis for ti/Ha, and set Q' = f dvol^ € Q^{Z). 

We will first show that f^^' < cAq (where 6 is as in Proposition 9.2), and then 
deduce that the same is true for f^ Q. 

We proceed to estimate Q'. By Proposition 6.4, an approximately orthonormal 
basis of tta'Ha, up to a scale factor (Stt^)^/^, is given by {1]'^ = 7r^X(b„,a„,Q„) := 7rX„}^ 
as (b„, a„, an) run over an orthonormal basis of Tp^N © R © A'^(Tp^N). Applying 
Gram-Schmidt to {7rX„}, it follows that /' < const • i7(7rXi, . . . , TTXs)dvol^ . Hence 
from (7.3), 

p*{n') < c f](7rXi, . . . , TiXs)\^d\ A dvols3 A dvolN- (9.8) 

Symbolically we can write il' = X]j=o ^'i ^^ ^ ^^'^ ^^ terms of the form F* A Rem^^ ^ 
< i < 3. We estimate the integrals of VL'^ one case at a time. Only the the completely 
nonlocal term f2g requires the pointwise decay estimate (9.6); for the remaining terms 
the uniform bound (9.5) suffices. Bounding J VL'^ requires some care but we shall see 
that the integrals of Vt'^ and VL'2 can be estimated heavy-handedly. 

Case 1: Terms of the form Rem\. 

Let Z2 d Z denote the subset of connections for which both dist(p, pa) and 
dist(g,pA) are > CiA^/^, where Ci is as in Proposition 9.2, and let Zi = Z — Z2. The 
sets Zl, Zi are the inverse images of sets W\, Wi C (0, Aq) x A^ under the map sending 
a connection to its scale and center. If for each A G (0, Aq) we define W\^\ := {y E N \ 
{X,y) E Wi}, then Wi^x is contained in the union of a ball of radius < A^/^ centered 
at p and a similar ball centered at q, so VoI(W^i^a) ^ ^^. 

For the orthonormal set {(b„, a„, «„)} we may choose four elements of the type 
(b, 0, 0) and four of the type (0, *, *), all normalized to unit length. Then from (9.6) 
on Zl we have 

\Remt{Xi, . . . ,X„)| < A-^+^^ ■ {coefficient of 6^a^ in {b^ + baX'^/^ + a^A"^/')^} 

(9.9) 



< ),-6+45 



rs^ 



Hence from (9.8) 

p*% < [ X-'+^'X'dX dvoh < I \x-^+^' Mo\{Wi,y^))dX < Xt'; (9.10) 

or\Zi) JWi Jo 

the integral over the gluing-parameter space S^ gives a constant factor. 
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Similarly, on Z2, \Rem2{Xi, . . . ,Xn)\ < A^^ryi(j9)^^ryi(g)^^; the two distances 
''^a{p),i^a{(i) enter this way because in the Rem2 term in fi{p)Afi{q), two of the Rem2^s 
are evaluated at p and two at q (see (8.10)). Since r^(p)~^r^(q')~^ < ta(p) '^ + ^a(q')~*^ 
and on W^2 both r^(p) and r^(g) are > cX^'"^, we have 



/ P*^o < / A^rfA / A-V-Vrfr < / A| log A| rfA < A^-^^ (9.11) 



"Ao / ^diam(Af) \ ^Aq 

p:% < I 

Combining this with the integral over Zi and summing over i, 



^0 < A^^ (9.12) 



Case 2: Terms of the form F A Rem^. 

In this and the remaining cases, F{Xi^Xj) is computed either at p or at g, and 
since the Xj are cut off outside a ball of radius ~ A^^^ centered at pa^ for i >1 terms 
of the form F"" /\Rem^\Xi, . . . ,X^)\p^q vanish unless {\,Pa) hes in the set Zi defined 
in Case 1. All points pa in the remaining computations can thus be assumed to lie 
in a single one of our sets f/j, and / -^{z\ P*^'i ~ Iz ^'i- 

Note that all vector fields X,Y ei)A satisfy |X|, \Y\ < f3{b+a\~\A) < b+a\-^/^, 
and hence |F(X, y)| < |F|(6 + aA~^/^)^. Using the uniform bound (9.5) to estimate 
the three Rem2 terms, we obtain the pointwise bound 

\F A Reml{Xi, . . . , Xg)] < |F| ■ {coefficient of h'^a^ in 

{P + ^X-^/2)2y^-Z+,6p2 ^ ^^y^-l/2 ^ ^2;^-l/2)3| 

< |F|A-^+3^ (9.13) 

where F is evaluated at either p or q. Because of the cutoff in Xj we may assume 
that Pa is a distance < cA^^^ from whichever of these points at which we evaluate. 
Hence, using (6.11), 

r /-Ao / /-cAi/^ X -5+35), 2 \ /-Ao 

y/'i</ >^'dxU JyY^^^'drUj^ A^^l log A^A < A^^ (9.14) 



Case 3: Terms of the form F"^ A Rem\. 

Here there are two sub-cases, depending on where the points at which F and 
Rem2 are evaluated; we can have terms of type F{p)F{p)Rem2{q)Rem2{q) or of type 
F{p)F{q)Rem2{p)Rem2{q). In each sub-case we bound the Rem2 terms using (9.5). 
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At whichever point F{Xi,Xj) is evaluated, we can again assume r^ < X^^"^, so that 
\Xi\ < & + aX"^^"^. Letting p',p" denote either of p, q, we then have 

\F^ AReml{Xi,...,Xs)\ < |F|(p')|i^|(/) ■ {coefficient of 6V in 

(5 + ^^-1/2)4^-2+25(^2 ^ ^^^-1/2 ^ ^2^-l/2)2| 

< i\F\\p') + \F\\p"))X''+''. (9.15) 

Hence the integral of the different types of F'^Rem\ terms can all be bounded by the 
integral of \F\'^{p)X-^^'^^: 

r r\) ( fcX^I'^ \-4+2<5a4 \ rM 

Jz^'^l ^'"^M/o W+W'^'^'jl ^"'^''|l°g^l^^~^o- (9.16) 



Case 4: Terms of the form F^ A Rem2- 

In the previous two cases we were rather wasteful in bounding |Xj| pointwise; this 
time we must be more economical. 

Since p and q enter the problem symmetrically it suffices to deal with terms of 
the form F{p)F{p)F{q)Rem2{q). Temporarily write r^ = r^(j5), r^ = rA{q),Fp = 
\F\{p),Fg = \F\{q). Note that for our term to be nonzero, both r^ and Vg must be 
< cX^^"^. Using this fact several times we find 

\F^ A Rem2{Xi, ...,Xs)\ < F^Fg ■ X'^^^ ■ {coefficient of fe^a^ in 

{b + aX-\p)\h + aX-\gf{h'' + haX-^'^ + a'X-^/^)] 

< X'^^^FlFgrl + F^Fgrl) 

< A-^+^ (fX + F; + X'f; + A-F^S) . (9.17) 

We can now replace p by g and integrate over the region {{X,pa) | < A < Aq, < 
i"a{p) < cA^/^ as in cases 2 and 3. For each of the four terms A'lFpr'^ in parentheses 
in (9.17), one finds J^ A*(A^/(A^ + r'^Yyr^r^dr < const, so 



/ ^3 < / " A^A-^+^rfA < A^. (9.1^ 

J z Jo 



Combining the four cases, this proves that J^ ^2' < cAq (assuming Proposition 
9.2). 

Now define Rem2^'^{X,Y) to be the right-hand side of (8.6), but with ^x^^y 
replaced by the objects ^'xi^y of (Z5). The form Vt is obtained from Vt' by replacing 
each occurrence of Rem2 with Rem2^'^- Hence Q — Q' can be expressed as a sum of 
terms of the form F'^[Rem2^^ — Rem2yRem\ for appropriate i, j, k. By part (b) of 
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Proposition 9.2, the bounds on \{Rem2^'^{X,Y) — Rem2{X,Y)\ are of precisely the 
same form as in part (a), so the same argument as above shows that f^{Q — Q') < cAg, 
estabhshing (9.1). | 



10 The proof of Proposition 9.2 

The proof of Proposition 9.2 is long, so we outline the strategy. To obtain (9.5), 
we need a pointwise bound on GQ{Rem2{X,Y)) (see (8.6)). If there were a four- 
dimensional Sobolev embedding L^ ^^ C°, then modulo extra terms arising from 
Weitzenbock identities that occur when comparing objects of the form ||V'^V'^0|| to 
objects of the form A^0, we could get a C" bound on Rem2{X, Y) from an L^ bound 
on Rem2{X,Y). (This, in turn, would require some L^ and/or pointwise bounds on 
e) 

Of course there is no embedding L2 "^-^ C°, but since the failure is borderline, 
any stronger Sobolev-type norm should give an embedding into C". The most effi- 
cient Sobolev inequality for our purposes is the following one. This inequality is not 
surprising, but may not be widely known, so we prove it in the appendix (Corollary 
11.2). 

Lemma 10.1 (^Sobolev Embedding Lemma.^ Let E be a vector bundle over a 
compact 4-dimensional manifold N. For p E N, let Vp denote distance to p. Then 
for any 6 > there exists a constant c{6) such that for all connections V on E, all 
0G T{E), and all p e N, 

<c(5)(ll0ll2 + lkp'VV0||2). (10.1) 



Hence 

ML < c{5) 



<c((5)sup(||0||2 + ||r-^VV0|l ). (10.2) 



P • • T 112/ 



We will use this lemma to get pointwise bounds on = GQ{Rem2{X, Y)). Hence 
we will need to estimate ||G^a;||2 and ||r~'^V^V^Gi^C(;|| for to = Rem2{X,Y). For 
general u, Proposition 10.2 estimates these in terms of weighted L^ norms of u, 
providing bounds whose only dependence on the connection is explicitly through the 
center point and scale. (This type of uniformity in the connection is the hard part of 
all our elliptic estimates. Uniformity is important since to estimate an integral over 
a family of connections we cannot use any bounds that depend on the connection in 
an uncontrolled way.) Proposition 10.2 also provides similar estimates of objects C, of 
the form appearing in (6.14), which we need for reasons discussed below. 
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The pointwise estimates of GqU in terms of weighted L^ norms of general u will 
be summarized (and generalized) as part of Proposition 10.2, specifically the first 
half of (10.16). To apply these general estimates to uj = Rem'2{X,Y) we still need 
to bound the weighted L^ norms of Rem'2{X,Y). To understand what this entails, 
write Rem'^iX, Y) = Rem'^^^^ + Rem'^^^^^^^^ + Rem'^^^^^^^^^^, where 

^em2,i„, = R!'{X, F), i?em2 ,^^ii„, = {X, ^y} - {F, ^x}, Rem^^r^onXoc = {Cx, ^y} 

(10.3) 
(see (8.4)). Because of the cutoffs in X and Y , the expressions Rem'2 1^^ and Rem'2 scmiioc 
are supported in B{pA, 2e), but Rem'2 nonioc i^ '^ot. Thus among the estimates we need 
are weighted L^ bounds on R"{X,Y). By Lemma 10.4, below, pointwise we find 

\R"{X,Y)\<c\X\\Y\{P + e-\){\F\+rA\V^F\) (10.4) 

(recall that X is the object that the cutoff (3 multiplies in the definition of X). Here x 
is the characteristic function of the annulus e < r^ < 2e. Thus to apply the estimate 
(10.16) of Proposition 10.2 to obtain bounds on GQ{Rem'2{X, Y)), we need to estimate 
certain expressions of the form ||/5r~'^r^F|| , and similar expressions with F replaced 
by V^-F and/or with f3 replaced by x- This will be accomplished in Lemma 10.5, 
where we will list all the purely local estimates we need. 

Weighted L^-norm bounds on Rem'2 semiioc ^"^^ -Rem'g nonioc '^^'^ ^e obtained from 
weighted L^-norm bounds on X and ^. The first of these is another purely local 
estimate. The second will be achieved in Proposition 10.8, where we will use the basic 
elliptic tools in Proposition 10.2 to turn the problem into a local estimate again. 

Till now we have made no mention of the role the point p plays in affecting the 
weighted norms. If we compute these weighted norms as above and take the supremum 
over j9 G X as in (10.2), we obtain only the sup-norm bound (9.5) for Rem2{X, Y). To 
prove Proposition 9.1 we additionally need the pointwise decay bound (9.6). Since the 
local quantities we bound are supported near the center point pa of A, decay is only 
an issue for the nonlocal quantities, but these are built out of Green operators applied 
to quantities supported near pa- Thus one expects that as the distance between p 
and Pa increases, the bounds on our non-local quantities should decrease. This turns 
out to be true (at least for dist{p, pa) > const ■ A^/^); we simply have to work harder, 
establishing some general pointwise bounds in Proposition 10.3. Our basic estimates 
in Proposition 10.2 are most useful for p close to pa] to get the bounds that lead to 
(9.6), in which p is farther from pa, we will apply Proposition 10.3. 

To establish (9.6) we again break up Rem'2{X,Y) into its local, semi-local, and 
non-local pieces as in (10.3). In the cases of -Remgi^^ and Rem,'2 ^^^^i^^, Proposi- 
tion 10.3 again reduces our work to weighted L^ bounds of purely local quantities. 
For -Rem'gjjonioj., however. Proposition 10.3 leaves us with bounding an expression of 
the form \\r]^ {^x,^y}\\2, and the obvious approach — Holder's inequality and the 
weighted L* bounds already obtained — does not give us a strong enough bound for 
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an adequate decay rate in (9.6). We will circumvent this by obtaining a pointwise 
decay estimate for C,, which in turn gives us a satisfactory decay rate for Rem2. (In 
fact, with the pointwise estimate on ^ in hand, it turns out that the contribution of 
Rem'2 nonioc ^'^ Rem2 is much smaller than the bounds we obtain from the other two 
terms.) 

With this discussion behind us, our procedure is clear. First we will fill our elliptic 
toolbox by proving Propositions 10.2 and 10.3. To apply these we need to compute 
weighted L^ norms of various quantities appearing in Rem^., which is our next step. 
The final step is then a matter of bookkeeping, applying the general elliptic tools to 
bound G^ of Rem'^^^^^, Rem'^^^^^^i^^, and Rem'^^^^^^^^^^. 

To avoid writing similar hypotheses over and over, and for notational simplicity, 
for the rest of this section we impose the following 

Blanket hypotheses and notation. A always denotes a connection with [A] G 
-^fc+iA (^^^ Definition 6.2). Every Proposition, Lemma, etc. has an implicit hy- 
pothesis "for Ao sufficiently small and for all [A] G -^^+1 Ao fl-^'^'''"- Constants c 
are uniform in A and in all other relevant parameters not explicitly shown (though 
some would depend on k and z/, if these were not fixed); e.g. c{6) depends only on 6. 
Constants are continually updated, and when a hitherto unnamed c appears, there 
is an implicit "for some constant c" . The notation "x < y" means x < cy. F always 
denotes the curvature of the conection A, and V = V"^ denotes the full covariant 
derivative on r{Ad P^A*T*N) (the tensor product connection determined by A and 
the Levi-Civita connection). Given any vector field X on A^, we write X = lxF (thus 
there is an A-dependence we suppress). We write pa for the center point of A and A 
for \{A). For any p & N, we let Vp denote distance to p, and write r^ for r^^. When 
a point p appears in a hypothesis, the letter d always means r^(p) = dist{p,pA). The 
scale e = const X^^"^ and cutoff P = /5std(^A/e) are always as in (6.8), and x denotes 
the characteristic function of the annulus {e < r^ < 2e} containing the support of 
dp. We also define the operators V = V^ : n\Ad P) -^ 9P{Ad P) © n\{Ad P) by 

V'r] = (^{dyr], V2dir]^ ; (10.5) 

thus ker('D'^) = H\, the harmonic space in the middle of the elliptic complex 

^ Q\Ad P)^Q\Ad P)^Ql{Ad P) -^ 0. (10.6) 

Define A^, A^, A^ to be the Laplacians on zero-forms, 1-forms, and SD two-forms, 
respectively, constructed from this complex, and let Gq, G^ be the inverses of A^, A^. 
Also define A^, G^ on n^{Ad P) ® ^l{Ad P) by A^ = A^ © A:^, G^ = G^ ® G^. 
Note that 

(I)^)*(0o,0+) = rf^0o + V2(rf+)>+, (10.7) 

51 



so that the quantity C,x = Cx of (6.14) can be written as 



Finally, observe that 



^x = {vyG^V'x 



v^'ivy = A^, {vyv" = a( 



(10.^ 



(10.9) 



Now we can finally begin proving Proposition 9.2. In the following proposition, 
what drives the estimates are two facts: (i) A^ is uniformly bounded below, and (ii) 
in the Weitzenbock identity for A^, only Riemannian curvature terms appear; F does 
not enter. 

Proposition 10.2 For 6o > sufficiently small and any 5, 5', 5" (possibly zero) of 
absolute value less thand^, such that for any p G A^ and anyuo G VlP{Ad P)®^\{Ad P) 



\Gi^h < Wrl-^'ujh 



and 



r„ V V (jr^tc; L >- r„ cj h + ^ 



S'-l\\„l-5-5' 



^1 2- 



(10.10) 



(10.11) 



Furthermore, if ^ = {V^yC^uj G Q\Ad P) (cf. (10.8)), then 






2 ' 



A "^lU ^ W A 
12 ' W P ^lU 



\r-^-^f\\ + llr^^f II + llr^'^V^^II < ||r"^cj|| + A' 



2 '^ 
2 



S'-l 



r\-'-''uj 



(10.12) 
(10.13) 

(10.14) 



and 



r-^V^^II +||r"'^V^V^^|| < ||r"'^(r'"^)*cj|| +A^ 

P ^lUIIP ^ll2^^IIP^ 'II2 



r-/-^'. 



+ X 



<5'-2 



r\-^-^u 



10.15) 
As a corollary of Lemma 10.1 and (10.12-10.15), if 60 > is sufficiently small and 

< 5 < 5o) then 
\Giuj\{p) < c{6) ■ {RHS of (10.11)} and \^{p)\ < c{6) ■ {RHS of (10.15)} . (10.16) 

We remark that in (10.11), (10.14), and (10.15) it is important that r^ appear 
where it does, rather than Vp, or we would not get strong enough estimates in our 
applications. The fact that both Vj^ and Vp appear together in Proposition 10.2 
complicates its proof. 

Proof: A slightly less general set of bounds was derived in Lemma 3.3 of [G2] for A^, 
the Laplacian on SD two-forms only, but for the reasons mentioned prior to stating 
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the proposition, essentially the same proof works here. The only differences are that 
(i) in [G2] the decay (6.11) was true on all of A^, not merely in i?(j9^,2e), and (ii) 
[G2] dealt only with the case p = pa- Since the cited proof is rather long, we will not 
repeat the parts that require only minor modifications, and will jump to the points 
of departure. 

To establish (10.10), the proof of Lemma 3.3a in [G2] works verbatim to show 
that 

\\r;'-'Giiu\\, + ||r/G^c.||4 + ||r, ^VG^c^lh < llr^^+^c^lh. (10.17) 

Note that 5 need not be positive here. Since IGqCu] < \rp~^G^uj\, (10.10) follows. 

Moving to (10.11), let rj G Q*{Ad P) be a form of arbitrary degree. The procedure 
in [G2] for proving its Lemma 3.3b,c — squaring, integrating by parts, commuting a 
covariant derivative past a trace Laplacian (V"^)*V'^ = V*V, and juggling terms — 
leads to 

Ik^'Vr^lU + \\rp'VVr]\\2 < c (||r;'^V*V77||2 + Wr^'vh + Wr^'Vrjh 

+ \\rA^''r;'FA\U\\rA'''''^v\\2) ; (10.18) 



here the smallness of \5\ has also been used to ensure that the term |5| ||r ^ ^Vr7||2 that 
initially comes up on the right-hand side is < l*^! lkp'^VV?7||2; see [G2], Lemma 3.2. 
(In [G2] there was no need to insert r^ .) First consider the case r] = G^tu, where 
iu e n^{AdP) © nl{AdP). The Weitzenbock formula gives A^r/ = u + n{G^uj), 
where TZ is an endomorphism proportional to the Riemann tensor. Moreover we 
will see in Lemma 10.5b below that for 5, 5' sufficiently small, \\r^ r~'^F4||4 < A^ '~^. 
Inserting these facts into (10.18), one can continue the argument as in [G2] and arrive 
at an extended version of (10.11): 

II -<5-1y7^A II I Wr-^Tjr'^t ill -L \\r~^T7T7r'^t ill < Wr^^t ill -L \^'~^\\r^^^^^'t ill 

(10.19) 

As for (10.12), since \T>*ri\ < clVr^l, the desired estimate follows from (10.17). 
By similar manipulations one can also establish 

\\r'/+'VGiLo\U + ||r;,^+VVG^c.||2 < \\r\''ujh. (10.20) 

Since V is V"^ followed by a covariantly constant projection, the same bounds hold 
with VG^u; replaced by 'D*G^uj = C,, yielding (10.13). For the same reason, (10.14) 
follows from (10.19). 

Finally, to establish (10.15), return to (10.18) and use the Weitzenbock formula 
for 1-forms, 

V*Ve = Afe + ^(0+^(0- (10.21) 
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Here T is an endomorphism proportional to F . Since Af,^ = V*V{V*G^uj) = V* 
(see (10.9)), we have 

||r-^V*Ve||2 < c (\\r-^V*u\\^ + \\r-^^\\^ 
Hence 



iU 



rS+s'r-Sp 



rT'i 



10.22) 



-5 



VeiU+llr-^VVeib < c( 









10.23) 



Once again \\r^ r ^F\\4 < A'' ^, and (10.19) (with VG^co replaced by ^) implies 



r-/-'^ 



r-/-^'V^ 



5-5' 



<c{ r-Z-'u 2 + X 



-l+5"|i -5-5'-5' 



'ujh) ■ (10.24) 



Using (10.20-10.19) to bound the other terms in (10.23), the bound (10.15) follows. 



Proposition 10.2 gives the same bounds for all p E N; to obtain (9.6), we need 
estimates that show decay as d = r^(p) grows. The following proposition provides 
these estimates. We separate the estimates into cases (a) and (b) below because for 
many purposes the only cj's for which we need to estimate the quantities in Proposition 
10.2 are compactly supported in a 2e-ball around p^, and we get sharper estimates in 
this case. Part (a) will thus be used to bound the terms GqR"{X, Y) and Gq{X, ^y} 
in Rem2{X,Y)] part (b) will be used to bound Gq{^x,Cy}- 



Proposition 10.3 Notation as in Proposition 10.2. There exists (5o > such that 
the following are true. 

(a) Suppose that for some cq (not necessarily related to t = cX^^"^ , and allowed 
to depend on u), (i) supp(c<j) C B{j)A,eo), (H) d = r^(p) = dist{p, pa) > 2eo, and 
(Hi) \F^\ < B on the complement ci/supp(cj). Let P be a cutoff function of the form 
/3std(4rp/(i) (so that supp(/9) C B{p,d/2)). Then for any 5' with \5'\ < 5q, and any 



\GUip) < c{6)d-'-'-'' {1 + B'/') 



rr- 



and 



rr- 



mp) < c{5)d-'-'' {d-^ + B) 
Thus, z/supp(c(j) C -B(pAi2e) and d > Ae = cX^^"^, then using (6.10), 

,1+5', 



\Giuj\{p) < c{5)d-'-'-'' 
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rx u) 



;i0.25) 
10.26) 

;i0.27) 



and 



mp)<c{5)d 



-2-5-5' 



tY^'u; 



;i0.28) 



(b) Suppose only that {F"^] < B on B{p,d/2), where d = dist{p, pa) > 0; suppose 
nothing about the support of uo. Let j3 be as in (a). Then for all 5' with \5'\ < 5q, we 
have 



\Gloo\{p)<c{6){l + B^/^)(d-^ 



-i-a-5' 



r\^^'u 



+ 



r-'Puj 



(10.29) 



Thus, if d > cX^^"^ , then 

IGUip) < c{6) (d-'-'-'' 



tY^'lo 



r-'Pu 



^10.30) 



Proof: (a) We will app' 

r;'VV0Giiu)h, PGiu 



y the Sobolev inequality (10.1), but first we must bound 
, and similar expressions with G^u replaced by ^. 



ii] First we will show that 



VV(;3G^c.)|h <d-^-'-'\l + B'l^) 



rr- 



'10.311 



Let Tj G VP{AdP) © Vt\{Ad P). Proceed as in the proof of Proposition 10.11 — 
squaring, integrating by parts, etc. — but this time leave the term proportional to F 
(which arises from commuting V^ past a trace-Laplacian) in integrated form. One 
arrives at 



-5 



VVr^Wl < \\r^'Ar^\\l+ \\r^'r^\\l+ Wr^'VvWl J r^''\F\\Vv\' (10.32) 



where A = (V )*V . Now replace rj by Prj. In the integral we have \F\ < B, so 



kp vv 



<c{\\r;'A{Prj)\\l + \\r: 



+ (l + B)\\r~'V 



(10.33) 



An integration by parts, plus various steps already seen in the proof of Lemma 10.2, 
gives 






2< 



r/A(^77) r;'Pr^ 



<c{ k 



r;'A07i) 



+ k'' 



r;'Pv 



for arbitrary k. Inserting this into (10.33), with k < {1 + B) ^, we find 



|r;^VV 



2 ^ ll'p ^ 



+ (l + 5)||r->|| 



(10.34) 
(10.35) 



55 



Using the Weitzenbock formula as in the proof of Lemma 10.2, we can replace A by 



A^, absorbing the zeroeth-order term into (1 + 5) 



rp^(3v 



we have 



rp^(3r] 



< 



r-'^im 



. Hence 



k; VV 



<(l + 5^/^)||r;^A^ 



. Additionally, by (10.17) 



;i0.36) 



Next, note that for any function /, 

|A^(/r^) - /A^T^I < (|VV/||7^| + |V/||Vr^|). 



;i0.37) 



Apply this with f = P and r] = G-^u, noting that by the hypothesis on the support 
of u we have PA-^r] = Pr] = 0. Since \V^P\ < cd~^ , and since on the support of V/3 
we have both d/2 < r < d and d/2 < r^ < 3d/2, we obtain 



\r;'Ai0r^)\ < cr;'x{d-'\v\+d~'\Vv\) 



< cd' 



^l-S-5'~/.-l+5' 



xir'/^''\v\ + ri\Vv\) 



^10.38) 



where x denotes the characteristic function of the annulus d/4 < r < d/2. Inserting 
this into (10.36), we have 



k; vv 






'a '/ 



Xr-A^ri 



TaVti 



Now apply (10.17) to obtain 

||r;^VV(/3r/)||2 < (1 + B^/^)cd-^-'-'' 
which leads to (10.31). 



rr^ 



(10.39) 



;i0.40) 



Moving on to 
5 = 0, we have 



PGiuj 



and repeating some of the steps in the proof of (a) with 



pCiu 



< 



AiiPGiu;] 



< d- 



l-S' 



l+5'^yl, 



Xr~/^' G^u 



rVio 



+ 



,~.^<5' 



Xt'aVG^u 



This is smaller than the bound (10.31), so (10.1) gives (10.25). 



;io.4i) 



The bound (10.26) is derived by methods similar to the preceding and those used 
in Proposition 10.2. We leave the details to the reader. 
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(b) Proceed as in (a); the only change is that no we no longer have fioo = 0. 
The first effect of this change occurs in (10.38), where we have to add |r~"/9u;| to the 



RHS. The effect of this term is to add (1 + B^''^) 

and (10.40), hence to (10.31). There is a simih 
but its effect is smaller than the preceding one. 



r-"/5cu 



to the RHS of (10.39) 



2 

and (10.40), hence to (10.31). There is a similar change in the bound on ||/3G^C(j||2, 



To apply Propositions 10.2 and 10.3 to estimate Rem2, we need to estimate 
expresssions of the form ||r™a'||2 for various m, where u = i^em'a jo^,, -Rem'g ggj^iioc? o^ 
i?em2nojjioc (see 10.3). First we deal with the purely local object Rem2ioc{^y'^) = 
R"{X,Y). To start, we need a pointwise estimate, given by the next lemma. The 
conclusion of the lemma is deceptively simple; the way in which the derivatives of X 
and Y are coupled to each other and to V-F in the definition of Rem2 is crucial. 

Lemma 10.4 For X,Y e 1)a, 

\R"{X,Y)\<\X\\Y\{(3 + e-\){\F\+rA\VF\). (10.42) 

(Here X,Y are the un- cutoff' versions of X,Y; see (6.8).) 

We remark that for general vector fields, this lemma would be false. 

Proof: Let = R"{X,Y). From (8.4) we have R"{X,Y) = f3^R"{X,Y)+ terms 
involving the derivative of (3. The latter are easily dealt with, giving the terms 
proportional to x ^^ (10.42). For R"{X,Y), the first three terms in (8.4) have norm 
bounded by |F|(|X||y| + |AX||y| + |X||Ay|), and an easy computation shows that for 
X G i)A, \^X\ < |X|. Furthermore, because F is ASD (and hence Yang-Mills as well) 
and the "rotational" parts of X,Y are SD, the remaining three terms in R"{X,Y) 
would vanish if the metric on N were Euclidean. When we do the bookkeeping 
necessary for the 0{r\) difference between the metric coefficients Qij and Sij, the we 
obtain contributions bounded by |X||y|(|F| +rA|V-F|). | 



Thus, bounding \GqR"{X,Y)\ pointwise boils down to estimates of the form in 
the following lemma. 

Lemma 10.5 Let p E N he arbitrary and let d = dist{p, pa)- Then we have the 
following estimates. 

(a) Assume < 6 < 2 and n > —2 + 6. Then 

( A"-^ n-5 <2, 

\\Pr;'rlF\\^+pr^'rl^'F\l+\\Pr;'rl+'VF\\^<{ X'\\ogX\'/' n - 6 = 2, 

( Ai+("-^)/2, n-6>2 

(10.43) 
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and for all n 

\\xr;'rlF\\^ + \\xrp'rl+'VF\\^ < A^-^+^Z^. (10.44) 

1 /2 

(b) Let Co > Aq be some fixed number. For < 6 < 1 and —l + 6<n<3 we 
have , , , 



Proof: (a) First consider the case 6 = 0. From (6.11) one quickly finds 

r A" -2 < n < 2, 

||/5r:iFl|2 + ll/^r^'i^lL < \ ^'1 log^l'/' ^ = 2, (10.46) 

[ ;^i+n/2 ^ ^ 2 

and for all n, 

Wxr'XFW, < A^+^/l (10.47) 

As for ||/5r^'''^VF||, the same argument as in the proof of [Gl, Lemma 3.3b] shows 
that 

Wf^rT'^m ^ ^ Wf^^AFWl + \\PrT'F\\l + \\mrl+'F\\l + J f3'rf+'\Ff. (10.48) 
Since \dp\ < e"^X) "we have Irf/SIr^"*"^ < ce"x- Thus 

WPrT'^^Fl ^ Wf^^AFh + A"/' \\XF\\, + {^j P'r^^'W?^ ■ (10.49) 

From (6.11) one can deduce 

/ /• N 1/2 f A", n < 3, 

/A^"+2|Fn << A3|logA|i/2, n = 3, (10.50) 

\J J y ^3n/2-3/2^ ^>3^ 

Combining this with our previous bounds we find that 

ll/^r^^+VFJI^ < RHS of(10.46) (10.51) 

To bound Hxr^^^VFJI^, again use the analysis leading to (10.48), but with (3 replaced 
by a smooth extension of x of the form f{rA/t) with / supported in [1/2, 3]. (It is sim- 
plest first to note that since ta < 2e on supp(x), ||x^a''^^VF||2 < A^^+^^Z^ ||xVF||2,) 
Then analysis similar to the above leads to 

llxr^^+VFJI^ < RHS of(10.47). (10.52) 

This completes the case (5 = and we move on to the general case. 
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We first bound ||/3rp"'^rJF||2; the method for bounding ||/3rp''r^^^F||4 is identical 
Break the ball B{pj^, 2e) into two pieces: an inner region Bin = B{p, d/2) f] B{pa, 2e) 
and an outer region Bout = B{pA, 2e) — Bin- On Bout, we have Vp > d/2, and hence 
rA/r-p < {vp + d)/rp <3. Thus 



rp Ta 



F\ = {rA/rpfrl-'\F\ < r^'l^l, (10.53) 



implying 



( A"-^ -2<n-6 <2, 

< \ A2|logA|i/2, n-5 = 2, . 

( \^+(^-^y^, n - (5 > 2 

(10.54) 

For the integral over Bin, first suppose n — 6 < 2 and separately consider the 
cases d < X, d > X. In both cases note that d/2 < ta < 3d/2 in this region. When 
d < X, we then have ta ^ X and \F\ < A~^ on Bin, so 

ll/3r;V:^F||L^(B.„) < A"'^||r;i^.(^,„) < V^-'d''' < A^^. (10.55) 

On the other hand if rf > A, then since r^/ci is bounded above and below on Bin, 
(6.11) implies r^|F| < AV^"^ < A^rf"-^ Hence 

||/5r;V:iF|U.(B,„) < X'd-''\\r;%.^B.^) < X'd-''-' < A"-^ (10.56) 

since n — 6 < 2. Combining this with the estimate for Bout we obtain the top two 
lines of (10.43). 

li n — 6 > 2, separately consider the cases d < 4e and d > Ae. If rf < 4e, the 
procedure for the case 6 > X above yields 

||/3r;V:iF|U.(5^„) < A"-lr/|U.(,,,„) < X'd^-'-' < X'+^--'^l\ (10.57) 

the same bound as on Bin- If rf > 4e then on the support of (3 we have Tp > e, so 
r~^r^\F\ < A~'^/^r^|F|. Thus (10.46) yields the remaining case of (10.43) for the 
bound on Hr" '^r^F||2. 

The method for bounding ||/3r~'^r]^"^^V-F||, is essentially identical to the method 
for bounding ||/5r~'^r^''""'^VF||2, except that for the estimates over Bin, first multiply 
by a cutoff function of the form /3std(2rp/(i), and then integrate by parts as in (10.48). 

To bound llx^p'^^A-^IL' ^^^^ that on supp(x) we have \F\ < const and r^ < e, so 

\\X'p 'A^\\r^^ \\X'p II2 ~ ^ W'p llL2(B(p,2e)) ~ "^ • l^iU.dOJ 
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Similarly ||x^p'^^^V-F|[ < A"/^ ||x^„'^VF[| , and the same procedure as for 5 = 
completes the work. 



(b) First write 



\ p ^ lU — 



[i-pyr^iF 



+ 



Prl-'r-F 



10.59) 



where /5 is a cutoff of scale eo centered at pa- On the support oi 1 — j3 we have 
|F| < const, so the first term on the RHS is bounded by a constant. The second term 
can be estimated as in the proof of (b). | 



We are now in a position to bound IG^Rem'^i^^l pointwise, but we postpone this 
until we have collected the estimates needed to bound the semi-local and non-local 
contributions to G^Rem^- These require bounds on norms of ^x = T>*G^T>X^ which 
in turn require pointwise bounds on VX. 

Lemma 10.6 For any vector field X on N , and any ASD connection A, we have the 
pointwise formulas 

(d^TixFA = {d+X\FA) (10.60) 

and 

diiixFA) = SYmliVX*nFA, (10.61) 

where X* is the metric dual of X , d^X* is the self- dual part of dX* , SymQ(T) denotes 

the traceless symmetric part of a rank-two tensor field T G T{T*N Cg> T*N), and, 
in a local orthonormal basis 9'' of the cotangent bundle, (T, F) = ^TijFij G and 
T^F = TijFjkO' A 9''. Hence 



and 



\1^^i^xFA)\ < {\d+X*\ + |Sym2(VX*)|)|F^| (10.62) 



iV^'V^iixFA)] < c((|V(rf+X*)| + |V(Sym2(VX*))|)|F^| 

+ (|rf+X*| + |Sym^(VX*)|)|F^|) . (10.63) 



Hence if X E i)A, then 



and 



\V^X\ < {rAP + e-'x)\X\ \Fa\ (10.64) 



\\{V^YV^X\ < c\V^V^X\ < {P + e~\)\X\ (|F^| +r^|V^F^|). (10.65) 
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Proof: Using the facts that that d* = — * d*, d"^F = 0, and *F = —F, we have 

{d^iixF) = - * d^{*tx{*F)) = *d^{X* AF) = *{dX* AF) = {dX*, F); (10.66) 

this gives (10.60). Now fix p G A^. Calculating in a local orthonormal frame {cj} of 
TN and dual coframe {6^} with Vej|p = 0, 

= 5^(V,X,)p+(^^A.e,F) 

= p+ (Sym^(VX)ttF) (10.67) 

by Lemma 2.3 of [Gl]. Since for any symmetric 2-tensor T, the pure-trace part of T 
yields a se//-dual 2-form under the operation jjF, we may replace Sym^ by Symg in 
(10.67), and by simple representation theory, the p+ in (10.67) is redundant. 

(10.64-10.65) follow from Lemma 10.6 and a pointwise computation of d+X*, 
SymQ(VX*) that we leave to the reader. | 



Corollary 10.7 For all X G i)A, and all p E N, the elements X G Ha satisfy the 
following integral bounds. 



(a) If -1 <m <2, then 



rJX 



< c{m)X 



m— 1 



'10.68) 



(h) If —1 < m < , or if d = dist(p, p^) ^ e and —l<m<2, then 

<c(m)A™/'(6-A^/2 + a). (10.69) 



rri^^x 



(c) For all 5 E (0,1), 



rp\vyvx 



< c 



r^^V^V^X 



< \-\b + a ■ \-'/'] 



(10.70) 



Proof: Using (10.64-10.65), plus |X| < b + aX'^r^, most of these bounds follow 
directly from Lemma 10.5. The exception is (10.69) in the case m > 0, for which one 
must also use the triangle inequality Tp < rA + d < rA + A™/^. | 



We are now in a position to derive our final estimates on the norms of ^ needed to 
md IG^Remi 
to prove (6.13). 



bound |G^-Rem2sgjjjjioj,| and \GQRem'2^^^i^^\ pointwise. We also use the opportunity 
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Proposition 10.8 There exists 60 > such that if < 6 < 60 and < 6' < Sq, then 
for ^ = (V'^yG^'D'^X (with X e ^a) we have the following. 

(a)IfO<6< 5o then 

\^{P) I < A-^' {b-X-' + a- A-3/2) . (10.71) 

If furthermore d = r^ip) > 4e = cX^^"^ , then 

\m\ < c{5)rA{p)-^-'-''X''ib ■ A + a ■ X'/^). (10.72) 



(b) 

Uh<\''^\b.X + a.X'/') 

Since C,x = X — ttaX , this implies (6.13). 
(c) If0<5 < 5o then 



10.73) 



(10.74) 



(d) If\6\< 60 then 



a+<5 



elL ^ X'/\b-X + a-X' 



/2^ 



(10.75) 



Proof: (a) We will omit writing the (5-dependence of the constants. From (10.15) 
given 6, 60 as above there exists 6' > such that 



\m\ 



^ \\rp [a^) uj\\^ + A 



^-S-25' 



UJ 



+ A 



2(5'+<5-2 



\-25-25' 



UJ 



10.76) 



where oj = V'^X. Using Lemma 10.7, we compute 



A 



25' -1 



,-5-25' 



LU 



+ A 



2^+5-2 



,1-25-25' 



LU 



<X^'ib-X-^ + a-X-^/^). (10.77) 



The bound on ||rp'^((i^)*c<j|| from Lemma 10.7 is smaller than this, so we obtain 
(10.71). 

For (10.72), apply (10.28) and Lemma 10.7. 

(b), (c), and (d). Apply Proposition 10.2 and Corollary 10.7. | 



We remark that by using the pointwise decay estimate (10.76) one can obtain the 
weighted L"^ decay 

\K'\U < X-'r-Aipr'ib + a ■ A-V2) (10.78) 

for d > cA^/^, but this is of no help to us. 

We're now ready to collate all the estimates needed to prove Theorem 9.2. 



62 



Corollary 10.9 (a) There exist So > 0,6' > such that for < S < 6o, the following 
are true. 



\\r^'Rem',^,,^{X,Y) 
y/'Rem',^,,^{X,Y) 



2,scmiloc 



\\rp Rem'2 
\r^^^ Rpm' 



|rp Remi, 



2,nonlocl^5 ^ ) 

I 

2,nonloc 



|ri/^i?emf,_,„,(X,F) 



2 < \-'l\l?^-ha-\-^l^^a^.\'^). 

1^ < X^\b' + ba-\~'/' + a'.\~'/'). 

\ < \^'(b' + ba.\~'/' + a'.X-'). 

< x'+''{b' + ba-X~'/' + a'.X-') 



(b) There exists 6' > such that for all p E N , the following are true. 
\G^Rem'{X,Y)\{p) < X-'/'+'' (b' + ba ■ X''/' + a' ■ X-'). 



IG^Rem, 



2,sciniloc 



{X,Y)\{p) < X-'+''{b^ + ba-X-'/^ + a'-X~'/^). 



\GtRem'^^{X,Y)\{p) < X'' {b' + ba ■ X~'/' + a' ■ X''). 



(10.79) 
(10.80) 
(10.81) 
(10.82) 
(10.83) 
(10.84) 



(10.85) 
(10.86) 
(10.87) 



(c) There exists 5q > 0,5' > such that if < 5 < 5o and d > Ae = cA^^^ the 
following are true. 

\G^Rem'2^,^^{X,Y)\{p) < d'^'^'^' X^' {b'' ■ X^'^ + ba + a" ■ X'^/^) (10.88) 

|G^i?em'2,_n„,(X, Y)\{p) < d-'-'-'' X'' {b' + ba ■ X'^" + a^ ■ X-^'^). (10.89) 

\GtR^K,nonXo.{X,Y)\{p) < d-^-'-^'X^'ib-^-X + ba-X'/' + a'). (10.90) 



Proof: (a) These bounds follow directly from Lemma 10.4, Lemma Corollary 
10.7 the L^ bounds in Proposition 10.8, and Holder's inequality. 

(b) Use part (a) and Lemma 10.3. 

(c) Since Rem'2 loA^i^) ^"^^ -^^"^2scmiioc(^5 ^) ^^6 supported in B{pA,'2,e), for 
these terms we can apply (10.27) and the corresponding bounds in (a). As Rem'2 scmiioc(^' ^) 
is not locally supported, we appeal instead to (10.30): 



where /9 is a cutoff of scale d/2 as in Lemma 10.3. 



;io.9i) 



If we estimate 



^A^ {'C? O using (10.84) we obtain the right-hand side of (10.90). 
2 



Were we next to estimate 



r-'P{^,0 



, the resulting bound would be to large to 
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be of use. Instead, since d < ta < Sd on the support of P, we can use the pointwise 
bound (10.72) to find 



r;'P{^,^} 



< 



r-'P 



d 



-2-6-5' X r/ 



\5'{b-\ + a-\'/^)y 



< d-^-^'~'''X^''ib^ -X' + ba- A3/2 + a^.\) 

< d-'-'-''\^/^-'"{b^-+ba-\-'/' + a'-\-'), (10.92) 



which is much smaller than our bound on 



^A {^>0 • Thus (10.90) follows. ■ 



Finally, we have the 



Proof of Proposition 9.2. (a) Add the bounds (10.85-10.87) to obtain (9.5). If we 
add the bounds (10.88-10.90) we obtain a stronger bound than (9.6): 



\Rem2{X,Y)\ < r^^-^' \^' {b^ + ba ■ \-^ 



-1/2 



A-i/2) . 



(10.93) 



(b) In the proof of part (a), the only way in which ^ entered was through the L^ 



bounds on ||rp^4||4, ||r^ ^iWi, and the pointwise decay (10.72). Hence our assertion 
follows from the hypothesis (Z5) of section 7. | 



11 Appendix 

The point of the following weighted Sobolev inequality is that on an m-dimensional 
manifold there is no Sobolev embedding L™ "^-^ L°^, but the failure is borderline. 
Thus by introducing an arbitrarily small weight into the Sobolev norm we are able 
to obtain an embedding. 

Lemma 11.1 Let E -^ N be a Riemannian vector bundle with metric- compatible 
connection V, where N is compact, Riemannian, and m-dimensional (m > 1). Given 
p E N and R2 > Ri > 0, let Q{p; Ri, R2) denote the annulus {Ri < rp < R2}, where 
Tp is distance to p. There exists a constant c such that for any 6 > 0,R2 > Ri > 
(but smaller than the injectivity radius), any p E N , and any G '^{E) we have 



\<P{p)\ < cr^i-^/™)/?^ 



Consequently, 



.-&, 



-<5t 



/?2 -i?i II P ^\\L^{Q.{p;RuR2)) 



+ k; V0 



lL-(i?fl2(p)) 



11.11 
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< r(i-i/™) (||0||,™(^) + ||r;^V0||^„(^^) 



:il.2) 



and 



L°°{N) 



< cr(i-i/"^) 



L"^{N) 



+ =up(||r-V^|I,„.,„,)). (11.3) 



Proof: By Kato's inequality, it suffices to prove this for the trivial real line 
bundle, i.e. for functions on A^. 

First replace N by R™ and consider a compactly supported function / G C^{B{0, R)). 
Let 6 G S""^^. Then, using polar coordinates on B{0,R), we have 



1/(0)1 



R 



dr 



r, 6)dr 



< I \Vf\{r,9)dr, 



:ii.4) 



implying 

Vol(5— 1)1/(0)1 < 



[ de( [ \Vf\{r,e)dr] = [ iVfl-r^-'^dvol. (11.5) 



Applying the same argument on a normal-coordinate ball B{p,R) in N (where / G 
C^{B{p, R))), using the compactness of N to get uniformity in the constants below, 
we obtain 



\f{p)\<c f iVflrl-'^dvol 
Jb{p,r) 



r'^\Vf\rl'"'+^dvol 



B{p,R) 



< cWr^VfW \\r 



1 — ?Tt+<5 I 






L"^{B(p,R)) • 



L"^/<."^-'^){B{p,R)) 

(11.6) 



Now remove the assumption that / is supported inside a normal coordinate ball. 
Replace / in the preceding argument by f3{r)f, where /5 is a cutoff function identically 
1 for r < Ri and vanishing for r > R2; thus |V/9| < c/(i?2 — Ri)- We then have 

\fip)\ < cr(i-i/-)i?^||r^-^v(/?/)||^„ 

< c5-^'-'/"^^Ri {\\r;\VP)f\\^^ + \\Pr;'Vf\\,^) 



< c(5'(i-i/")i?^ 



1 



i?2 — Ri 



\ p •'I 



L^{n{Ri,R2,p)) 



Vp'^f\ 



L--(BrJp)) 



:ii.7) 



yielding (11.1). Taking R2 = 2Ri to be, say, half the injectivity radius of A^, we 
obtain (11.3). | 



As a corollary, we have 
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Corollary 11.2 Let i5, A^, V be as in Lemma 11.1, and assume dim(N)=4- Then 
for all 6 G (0, 1), there exist constants c{6) such that for all G ^{E), 

(11.8) 



<c(5)sup(||0l|2 + ||r/VV0|| V (11.9) 



|0(p)| < c{6) [HhHN) + Ikp VV0||^,(^^^ 
and hence 

Proof: Applying Lemma 11.1 with m = 4, we have 

|0(p)|<c(5)(||0||4+||r;^V0||^) (11.10) 

Using the Sobolev embedding Ll{N) "-^ L'^{N) we then find 

|0(p)| < c{5) (1I0II2 + \\r;'-'V42 + Ikp'VV^^IL) (11-11) 

But since 5 < 1, we also have the weighted Sobolev inequality of Lemma 3.1 of [G2]: 

\\r~^''n2 < c{\\r-'^h + llr-'^V^Ih) (11.12) 

(the proof is again a polar-coordinate computation). Using this we can bootstrap 
(11.11) into the form (11.8) . | 



References 

[Dl] S. K. Donaldson: Connections, Cohomology and the Intersection Forms of 
Four Manifolds. J. Diff. Geom. 24 (1986), 275-341. 

[D2] S. K. Donaldson: Polynomial Invariants for Smooth 4-manifolds. Topology 29 
(1990), 257-315. 

[D3] S. K. Donaldson: The orientation of Yang-Mills moduli spaces and 4-manifold 
topology J. Diff. Geom. 26 (1987), 397-428. 

[DK] S. K. Donaldson and P. Kronheimer: The Geometry of Four-Manifolds. Oxford 
University Press, Oxford (1990). 

[Gl] D. Groisser: Curvature of Yang-Mills moduli spaces near the boundary, 1. 
Gommun. in Anal, and Geom. 1 (1993) 139-216. 



66 



[G2] D. Groisser: Totally geodesic boundaries of Yang-Mills moduli spaces. Houston 
J. Math., to appear. 

[GPl] and T. H. Parker: The geometry of the Yang-Mills moduli space 

for definite manifolds. J. Diff. Geom. 29 (1989) 499-544. 

[GP2] : Sharp decay estimates for Yang-Mills fields. 

Commun. in Anal, and Geom., to appear. 

[GPS] : Differential forms on the Yang-Mills moduli 

space. Unpublished notes. 

[KM] P. Kronheimer and T. Mrowka: Embedded Surfaces and the Structure of Don- 
aldson's Polynomial Invariants, J. Diff. Geom. 41 (1995), 573-734. 

[SI] L. Sadun: A simple geometric representative for /x of a point. Gommun. Math. 
Phys 178 (1996) 107-113. 

[S2] : Simple type is not a boundary phenomenon. In Geometry, Topology 

and Physics: Proceedings of the First Brazil- USA Workshop, Apanasov et al 
ed., Walter de Gruyter, Berlin (1997). 

[T] C.H. Taubes: Metrics, Gonnections and Gluing Theorems. CBMS Regional 
Conference Series in Mathematics 89, American Mathematical Society, Prov- 
idence (1996). 



67 



